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METHOD AND APPARATUS FOR DIRECT AND INDIRECT TEXTURE PROCESSING IN A GRAPHICS SYSTEM 
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This application is made by the below identified inventor(s). Attached hereto are the followinq papers* 
™ An abstract together with 

pages of specification and claims including 
numbered claims and also attached is/are 
sheets of accompanying drawings. 

This application is based on the following prior foreign applications- 
Application No. Country Filing Date 

respectively, the entire content of which is hereby incorporated by reference in this application, and priority is herebv 
claimed therefrom. 

This application is based on the following prior provisional application(s): 
q Application No. Filing Date 

jn 60/226,891 23 August 2000 

Respectively, the entire content of which is hereby incorporated by reference in this application, and priority is herebv 
^claimed therefrom. 

IjO Certified copy/ies of foreign applications attached. 

This application is a □ continuation/Q division/D continuation-in-part of application Serial No filed 
Please amend the specification by inserting before the first line: --This application is a □ continuation/!!] division/' 
U continuation-in-part of application Serial No. , filed , the entire content of which is hereby incorporated by 
reference in this application. - 

Please amend the specification by inserting before the first line: --This is a continuation of PCT application No. 

, filed , the entire content of which is hereby incorporated by reference in this application.- 
Please amend the specification by inserting before the first line: -This application claims the benefit of U S 
Provisional Application No. 60/226,891, filed 23 August 2000, the entire content of which is hereby incorporated by 
reference in this application. -- 

Preliminary amendment to claims (attached hereto), to be entered before calculation of the fee 
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Method And Apparatus For Direct and Indirect Texture Processing In A 

Graphics System 

Related Applications 

This application claims the benefit of U.S. Provisional Application, Serial 
No. 60/226,891, filed August 23, 2000, the entire content of which is hereby 
incorporated by reference. 

Field of the Invention 

The present invention relates to computer graphics, and more particularly to 
interactive graphics systems such as home video game platforms. Still more 
particularly this invention relates to direct and indirect texture mapping/processing 
in a graphics system. 

Background And Summary Of The Invention 

Many of us have seen films containing remarkably realistic dinosaurs, 
aliens, animated toys and other fanciful creatures. Such animations are made 
possible by computer graphics. Using such techniques, a computer graphics artist 
can specify how each object should look and how it should change in appearance 
over time, and a computer then models the objects and displays them on a display 
such as your television or a computer screen. The computer takes care of 
performing the many tasks required to make sure that each part of the displayed 
image is colored and shaped just right based on the position and orientation of each 
object in a scene, the direction in which light seems to strike each object, the 
surface texture of each object, and other factors. 
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Because computer graphics generation is complex, computer-generated 
three-dimensional graphics just a few years ago were mostly limited to expensive 
specialized flight simulators, high-end graphics workstations and supercomputers. 
The public saw some of the images generated by these computer systems in movies 
5 and expensive television advertisements, but most of us couldn't actually interact 
with the computers doing the graphics generation. All this has changed with the 
availability of relatively inexpensive 3D graphics platforms such as, for example, 
the Nintendo 64® and various 3D graphics cards now available for personal 
computers. It is now possible to interact with exciting 3D animations and 
10 simulations on relatively inexpensive computer graphics systems in your home or 
O office. 

rj A problem graphics system designers confronted in the past was how to 

MS 3 

j!1 create realistic looking surface detail on a rendered object without resorting to 
S; explicit modeling of the desired details with polygons or other geometric 
j\ 15 primitives. Although surface details can be simulated, for example, using myriad 
£ small triangles with interpolated shading between vertices, as the desired detail 
BP becomes finer and more intricate, explicit modeling with triangles or other 
O primitives places high demands on the graphics system and becomes less practical. 
An alternative technique pioneered by E. Catmull and refined by J. F. Blinn and M. 
20 E. Newell is to "map" an image, either digitized or synthesized, onto a surface. 
(See "A Subdivision Algorithm for Computer Display of Curved Surfaces" by E. 
Catmull, Ph.D. Thesis, Report UTEC-CSc-74-133, Computer Science Department, 
University of Utah, Salt Lake City, UT, December 1994 and 'Texture and 
Reflection in Computer Generated Images" by J. F. Blinn and M. E. Newell, 
25 CACM, 19(10), October 1976, 452-457). This approach is known as texture 
mapping (or pattern mapping) and the image is called a texture map (or simply 
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referred to as a texture). Alternatively, the texture map may be defined by a 
procedure rather than an image. 

Typically, the texture map is defined within a 2D rectangular coordinate 
space and parameterized using a pair of orthogonal texture coordinates such, as for 
5 example, (u, v) or (s, t). Individual elements within the texture map are often 
called texels. At each rendered pixel, selected texels are used either to substitute 
for or to scale one or more material properties of the rendered object surface. This 
process is often referred to as texture mapping or "texturing." 

Most 3-D graphics rendering systems now include a texturing subsystem for 
10 retrieving textures from memory and mapping the textures onto a rendered object 
£ surface. Sophisticated texturing effects utilizing indirect or multiple textures are 
m also possible such as, for example, multi-texturing, meta-textures or texture tiling, 
y but conventional approaches typically involve complex hardware arrangements 
j*j such as using multiple separate texture retrieval/mapping circuits (units) where the 
^15 output of one texturing circuit provides the input to a next texturing circuit. Such 
J duplicated circuitry is essentially idle whenever such effects are not used. In on- 
g chip graphics processing implementations, the additional circuitry requires more 
y chip real-estate, can reduce yield and reliability, and may significantly add to the 
overall production cost of the system. Consequently, a further problem 
20 confronting graphics system designers is how to efficiently implement these more 
sophisticated texturing effects without associated increases in texture mapping 
hardware complexity. 

One solution is to use a single texture addressing/mapping circuit and 
perform multiple texturing passes. Nominally, this may require at least generating 
25 a first set of texture addressing coordinates, accessing a first texture, storing the 
data retrieved in a temporary storage, and then regenerating the same set of texture 
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coordinates again for use in computing new coordinates when accessing a second 
texture in the next or a subsequent texturing pass. Although this approach may 
reduce hardware complexity somewhat, it is fairly time consuming, requires 
generating/providing the same set of texture coordinates multiple times, and results 
5 in inefficient processing during mode changes (e.g., switching between direct and 
indirect texturing operational modes). Moreover, this approach results in a very 
course granularity in the data processing flow through the graphics rendering 
system — significantly affecting polygon fill rate. 

To solve this problem and to provide an enhanced repertoire of texturing 
10 capabilities for a 3-D graphics system, the present invention provides a versatile 
^ texturing pipeline arrangement achieving a relatively low chip-footprint by 

utilizing a single texture address coordinate/data processing unit that interleaves 
]r* the processing of logical direct and indirect texture coordinate data and provides a 
^ texture lookup data feedback path for "recirculating" retrieved indirect texture 
j\. 15 lookup data from a single texture retrieval unit back to the texture address 
!r; coordinate/data processing unit. The interleaved coordinate processing and 
g recirculated/feedback data arrangement of the present invention allow efficient 
0 processing of any number of logical direct and/or indirect texture mapping stages 
from a smaller number of hardware texture processing units while preserving a fine 
20 granularity in the overall data processing flow. 

In accordance with one aspect provided by the present invention, the 
recirculating/data-feedback arrangement of the texturing pipeline portion of the 
graphics processing enables efficient use and reuse of a single texture lookup 
(retrieval) unit for both logical direct and indirect texture processing without 
25 requiring multiple rendering passes and/or temporary texture storage hardware. 
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In accordance with another aspect provided by the invention, the texture 
address (coordinate) processing hardware is arranged to perform various 
coordinate computations based on the recirculated/feedback texture data and to 
process both direct and indirect coordinate data together in a substantially 
5 continuous interleaved flow (e.g., to avoid any "course granularity M in the 

processing flow of graphics data throughout the system). This unique interleaved 
processing/data-recirculating texture pipeline arrangement enables efficient and 
flexible texture coordinate processing and texture retrieval/mapping operations 
while using a minimum amount of hardware for providing an enhanced variety of 
10 possible direct and indirect texturing applications. 

In accordance with another aspect provided by this invention, an effectively 
continuous processing of coordinate data for performing logical direct and indirect 
texture lookups is achieved by interleaving the processing of both direct and 
indirect coordinate data per pixel within a single texture coordinate processing 
15 hardware unit. For example, a selector can be used to look for "bubbles'' (unused 
cycles) in the indirect texture coordinate stream, and to insert computed texture 
coordinate data in such "bubbles" for maximum utilization of the texture mapper. 

In accordance with yet another aspect provided by the invention, a hardware 
implemented texturing pipeline includes a texture lookup data feedback path by 
20 which the same texture data retrieval unit can be used and reused to: 

• both lookup direct indirect textures, and 

• supply indirect texture lookup data. 

The same texture address (coordinate) processing unit can be used and reused for 
processing both logical direct and indirect texture coordinate data and computing 
25 new/modified texture coordinates. 
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In accordance with yet another aspect provided by this invention, a set of 
texture mapping parameters is presented to a texture mapping unit which is 
controlled to perform a texture mapping operation. The results of this texture 
mapping operation are recirculated and used to present a further set of texture 
mapping parameters which are fed back to the input of the same texture mapping 
unit. The texture mapping unit performs a further texture mapping operation in 
response to these recirculated parameters to provide a further texture mapping 
result. 

The first texture mapping operation may comprise an indirect texture 
mapping operation, and a second texture mapping operation may comprise a direct 
texture mapping operation. The processing and presentation of texture mapping 
parameters to a texture mapping unit for performing direct texture mapping 
operations may be interleaved with the processing and presentation of texture 
mapping parameters for performing indirect direct texture mapping operations. 

In accordance with a further aspect provided by this invention, a method of 
indirect texture referencing uses indirect texture coordinates to generate a data 
triplet which is then used to derive texture coordinates. The derived texture 
coordinates are then used to map predetermined texture data onto a primitive. In 
accordance with yet a further aspect provided by the invention, the retrieved data 
triplet stored in texture memory is used to derive a set of modified texture 
coordinates which are then used to reference texture data stored in the texture 
memory corresponding to a predetermined texture. 

In accordance with yet another aspect provided by this invention, a graphics 
system includes: 
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• a texture coordinate/data processing unit for interleaved processing of 
logical direct and indirect coordinate data comprising an arrangement of 
at least one data multiplier and at least one data accumulator; 

• a texture data retrieval unit connected to the coordinate/data processing 
5 unit, the texture data retrieval unit retrieving texture data stored in a 

texture memory; and 

• a data feedback path from a texture data retrieval unit to the texture 
coordinate/data processing unit to recycle retrieved texture data through 
the texture coordinate/data processing unit for further processing; 

10 • wherein in response to a set of texture coordinates the retrieval unit 

provides retrieved texture data to the processing unit for deriving 
modified texture coordinates which are used in mapping a texture to a 
surface of a rendered image object. 
In accordance with yet another aspect provided by this invention, a texture 
15 processing system for selectively mapping texture data corresponding to one or 
more different textures and/or texture characteristics to surfaces of rendered and 
displayed images includes a texture coordinate offset matrix arrangement 
producing a set of offset texture coordinates by multiplying indirect texture data by 
elements of a matrix, wherein one or more elements of the matrix are a 
20 mathematical function of one or more predetermined direct texture coordinates and 
one or more elements of the matrix can be selectively loaded. 

In accordance with yet another aspect provided by this invention, a set of 
indirect texture coordinates are used to retrieve data triplets stored in texture 
memory, and a set of modified texture coordinates are derived based at least in part 
25 on the retrieved data triplets. The set of modified texture coordinates is then used 
for retrieving data stored in texture memory. These steps are reiteratively repeated 
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for a predetermined number of data retrievals, and a set of derived texture 
coordinates resulting from the repetition is used to map predetermined texture data 
onto a primitive. 

In accordance with yet another aspect provided by the invention, a set of 
5 generalized API (application program interface) indirect texture mapping functions 
are defined and supported by the texturing pipeline apparatus which permits 
specifying arguments for performing at least four indirect-texture operations 
(indirect lookup stages) and for selectively associating one of at least eight pre- 
defined textures and one of at least eight pre-defined sets of texture coordinates 
10 with each indirect texturing operation. The defined API indirect texture mapping 
functions also permit specifying texture scale, bias and coordinate wrap factors as 
well as a variety of texture coordinate offset multiplication matrix configurations 
and functions for computing new/modified texture lookup coordinates within the 
texturing pipeline. 

15 In accordance with yet a further aspect provided by the invention, a texture 

address (coordinate) processing unit transforms retrieved texture color/data from 
an indirect texture lookup into offsets that are added to the texture coordinates of a 
regular (non-direct) texture lookup. The feedback path provides texture color/data 
output from a texture retrieval unit to a texture coordinate processing unit used to 

20 generate/provide texture coordinates to the texture retrieval unit. 

In accordance with yet a further aspect provided by the invention, a single 
texture address processing unit comprising at least a pair of FIFO buffers is utilized 
for interleaving and synchronizing the processing of both "direct" (regular non- 
indirect) and "indirect" texture coordinates, and a single texture data retrieval unit 
25 is used for retrieving and recirculating indirect-texture lookup data back to the 
texture address processing unit for computing new/modified texture lookup 
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coordinates. In an example embodiment, the retrieved indirect-texture lookup data 
is processed as multi-bit binary data triplets of three, four, five, or eight bits. The 
data triplets are multiplied by a 3 X 2 element texture coordinate offset matrix 
before being optionally combined with direct coordinate data, or with computed 
5 data from a previous cycle/stage of texture address processing, to compute 

modified offset texture coordinates for accessing a texture map in main memory. 
Values of the offset matrix elements are programmable and may be dynamically 
defined for successive processing cycles/stages using selected predetermined 
constants or values based on direct coordinates. A variety of offset matrix 
10 configurations are selectable including at least three offset matrix configurations 
containing elements based on programmable constants and two "variable" matrix 
configurations containing elements based on a values from a set of direct texture 
coordinates. Circuitry for optionally biasing and scaling retrieved texture data is 
also provided 

15 Brief Description Of The Drawings 

These and other features and advantages provided by the invention will be 
better and more completely understood by referring to the following detailed 
description of presently preferred embodiments in conjunction with the drawings. 
The file of this patent contains at least one drawing executed in color. Copies of 
20 this patent with color drawing(s) will be provided by the Patent and Trademark 
Office upon request and payment of the necessary fee. The drawings include the 
following figures: 

Figure 1 is an overall view of an example interactive computer graphics 
system; 

25 Figure 2 is a block diagram of the Figure 1 example computer graphics 

system; 



Figure 3 is a block diagram of the example graphics and audio processor 
shown in Figure 2; 

Figure 4 is a block diagram of the example 3D graphics processor shown in 
Figure 3; 

5 Figure 5 is an example logical flow diagram of the Figure 4 graphics and 

audio processor; 

Figure 6 is block diagram illustrating a logical overview of indirect texture 
processing in accordance with the present invention; 

Figure 7A is a functional block diagram illustrating a simple basic example 
D 10 of a regular (non-indirect) texture lookup; 

i{5 Figure 7B is a functional block diagram illustrating a simple basic example 

of an indirect texture lookup in accordance with the present invention; 

fU Figure 8 is a block diagram illustrating an overview of an example physical 

^ configuration for implementing indirect texture processing in accordance with the 
1 5 present invention ; 

O Figure 9 is a block diagram illustrating a logical overview of the texture 

address (coordinate/data) processor operation; 

Figures 10A-10K are a series of block diagrams illustrating the dynamic 
progression of direct and indirect data in the example texturing pipeline 
20 implementation to provide interleaved direct and indirect texture processing; 

Figure 1 1 is a flow chart illustrating example steps for implementing indirect 
texture processing in accordance with the present invention; 

Figure 12 is a functional operations diagram illustrating an example of 
regular (non-indirect) texture processing in accordance with the present invention: 
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Figure 13 is a functional operations diagram illustrating an example of 
interleaved regular (non-indirect) and indirect texture processing in accordance 
with the present invention; 

Figure 14 is a block diagram showing a detailed example of the texture 
coordinate/bump processing unit shown in Figure 5; 

Figure 15 is a block diagram showing a detailed example of the indirect 
texture lookup data/coordinate processing logic (proc) shown in Figure 14; 

Figures 16A and 16B show example texture offset matrices used by 
processing logic circuit (proc) of Figure 15; 

Figure 17 is a block diagram illustrating example data field formats of 
control logic registers for controlling the operations within the processing circuitry 
of Figure 15; 

Figures 18 and 19 show example alternative compatible implementations. 

Detailed Descripti on Of Example Embodiments Of The Invention 

Figure 1 shows an example interactive 3D computer graphics system 50. 
System 50 can be used to play interactive 3D video games with interesting stereo 
sound. It can also be used for a variety of other applications. 

In this example, system 50 is capable of processing, interactively in real 
time, a digital representation or model of a three-dimensional world. System 50 
can display some or all of the world from any arbitrary viewpoint. For example, 
system 50 can interactively change the viewpoint in response to real time inputs 
from handheld controllers 52a, 52b or other input devices. This allows the game 
player to see the world through the eyes of someone within or outside of the 
world. System 50 can be used for applications that do not require real time 3D 
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interactive display (e.g., 2D display generation and/or non-interactive display), but 
the capability of displaying quality 3D images very quickly can be used to create 
very realistic and exciting game play or other graphical interactions. 

To play a video game or other application using system 50. the user first 
5 connects a main unit 54 to his or her color television set 56 or other display device 
by connecting a cable 58 between the two. Main unit 54 produces both video 
signals and audio signals for controlling color television set 56. The video signals 
are what controls the images displayed on the television screen 59. and the audio 
signals are played back as sound through television stereo loudspeakers 61L, 61R. 

10 The user also needs to connect main unit 54 to a power source. This power 

^0 source may be a conventional AC adapter (not shown) that plugs into a standard 
rU home electrical wall socket and converts the house current into a lower DC voltage 
M signal suitable for powering the main unit 54. Batteries could be used in other 
ry implementations. 

5^15 The user may use hand controllers 52a. 52b to control main unit 54. 

W Controls 60 can be used, for example, to specify the direction (up or down, left or 
0 right, closer or further away) that a character displayed on television 56 should 
move within a 3D world. Controls 60 also provide input for other applications 
(e.g., menu selection, pointer/cursor control, etc.). Controllers 52 can take a 
20 variety of forms. In this example, controllers 52 shown each include controls 60 
such as joysticks, push buttons and/or directional switches. Controllers 52 may be 
connected to main unit 54 by cables or wirelessly via electromagnetic (e.g., radio 
or infrared) waves. 

To play an application such as a game, the user selects an appropriate 
25 storage medium 62 storing the video game or other application he or she wants to 
play, and inserts that storage medium into a slot 64 in main unit 54. Storage 



medium 62 may, for example, be a specially encoded and/or encrypted optical 
and/or magnetic disk. The user may operate a power switch 66 to turn on main 
unit 54 and cause the main unit to begin running the video game or other 
application based on the software stored in the storage medium 62. The user may 
5 operate controllers 52 to provide inputs to main unit 54. For example, operating a 
control 60 may cause the game or other application to start. Moving other controls 
60 can cause animated characters to move in different directions or change the 
user's point of view in a 3D world. Depending upon the particular software stored 
within the storage medium 62, the various controls 60 on the controller 52 can 
10 perform different functions at different times. 

Jj Example Electronics of Overall System 

i! r„£ 

lit Figure 2 shows a block diagram of example components of system 50. The 

m primary components include: 

f • a main processor (CPU) 1 10, 

^ 15 • a main memory 112, and 

2 • a graphics and audio processor 1 14. 

Q In this example, main processor 1 10 (e.g., an enhanced IBM Power PC 750) 

receives inputs from handheld controllers 108 (and/or other input devices) via 
graphics and audio processor 1 14. Main processor 1 10 interactively responds to 
20 user inputs, and executes a video game or other program supplied, for example, by 
external storage media 62 via a mass storage access device 106 such as an optical 
disk drive. As one example, in the context of video game play, main processor 110 
can perform collision detection and animation processing in addition to a variety of 
interactive and control functions. 
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In this example, main processor 1 10 generates 3D graphics and audio 
commands and sends them to graphics and audio processor 1 14. The graphics and 
audio processor 114 processes these commands to generate interesting visual 
images on display 59 and interesting stereo sound on stereo loudspeakers 61R, 61L 
5 or other suitable sound-generating devices. 

Example system 50 includes a video encoder 120 that receives image signals 
from graphics and audio processor 1 14 and converts the image signals into analog 
and/or digital video signals suitable for display on a standard display device such 
as a computer monitor or home color television set 56. System 50 also includes an 

10 audio codec (compressor/decompressor) 122 that compresses and decompresses 
digitized audio signals and may also convert between digital and analog audio 
signaling formats as needed. Audio codec 122 can receive audio inputs via a 
buffer 124 and provide them to graphics and audio processor 1 14 for processing 
(e.g., mixing with other audio signals the processor generates and/or receives via a 

15 streaming audio output of mass storage access device 106). Graphics and audio 
processor 114 in this example can store audio related information in an audio 
memory 126 that is available for audio tasks. Graphics and audio processor 114 
provides the resulting audio output signals to audio codec 122 for decompression 
and conversion to analog signals (e.g., via buffer amplifiers 128L, 128R) so they 

20 can be reproduced by loudspeakers 61L, 61R. 

Graphics and audio processor 114 has the ability to communicate with 
various additional devices that may be present within system 50. For example, a 
parallel digital bus 130 may be used to communicate with mass storage access 
device 106 and/or other components. A serial peripheral bus 132 may 
25 communicate with a variety of peripheral or other devices including, for example: 

• a programmable read-only memory and/or real time clock 134, 
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• a modem 136 or other networking interface (which may in turn connect 
system 50 to a telecommunications network 138 such as the Internet or 
other digital network from/to which program instructions and/or data can 
be downloaded or uploaded), and 

5 • flash memory 140. 

A further external serial bus 142 may be used to communicate with additional 
expansion memory 144 (e.g., a memory card) or other devices. Connectors may be 
used to connect various devices to busses 130, 132, 142. 

Example Graphics And Audio Processor 

0 10 Figure 3 is a block diagram of an example graphics and audio processor 1 14. 

SI Graphics and audio processor 1 14 in one example may be a single-chip ASIC 

m (application specific integrated circuit). In this example, graphics and audio 

(B processor 1 14 includes: 

f s • a processor interface 1 50, 

J: 15 • a memory interface/controller 152, 

• a 3D graphics processor 154, 

° • an audio digital signal processor (DSP) 156, 

• an audio memory interface 158, 

• an audio interface and mixer 160, 
20 • a peripheral controller 162, and 

• a display controller 164. 

3D graphics processor 154 performs graphics processing tasks. Audio 
digital signal processor 156 performs audio processing tasks. Display controller 
164 accesses image information from main memory 112 and provides it to video 
25 encoder 120 for display on display device 56. Audio interface and mixer 160 
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interfaces with audio codec 122, and can also mix audio from different sources 
(e.g., streaming audio from mass storage access device 106, the output of audio 
DSP 156, and external audio input received via audio codec 122), Processor 
interface 150 provides a data and control interface between main processor 1 10 
5 and graphics and audio processor 114. 

Memory interface 152 provides a data and control interface between 
graphics and audio processor 1 14 and memory 1 12. In this example, main 
processor 110 accesses main memory 112 via processor interface 150 and memory 
interface 152 that are part of graphics and audio processor 114. Peripheral 
10 controller 162 provides a data and control interface between graphics and audio 
processor 1 14 and the various peripherals mentioned above. Audio memory 

v » 

■'!~s 

fg] interface 158 provides an interface with audio memory 126. 

CO Example Graphics Pipeline 

: Figure 4 shows a more detailed view of an example 3D graphics processor 

H; 15 154. 3D graphics processor 154 includes, among other things, a command 

fu 

m processor 200 and a 3D graphics pipeline 180. Main processor 1 10 communicates 
O streams of data (e.g., graphics command streams and display lists) to command 
processor 200. Main processor 1 10 has a two-level cache 115 to minimize 
memory latency, and also has a write-gatheiing buffer 1 1 1 for non-cached data 
20 streams targeted for the graphics and audio processor 1 14. The write-gathering 
buffer 111 collects partial cache lines into full cache lines and sends the data out to 
the graphics and audio processor 1 14 one cache line at a time for maximum bus 
usage. 

Command processor 200 parses display commands received from main 
25 processor 110 — obtaining any additional data necessary to process the display 
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commands from shared memory 112. The command processor 200 provides a 
stream of vertex commands to graphics pipeline 180 for 2D and/or 3D processing 
and rendering. Graphics pipeline 180 generates images based on these commands. 
The resulting image information may be transferred to main memory 1 12 for 
5 access by display controller/ video interface unit 164 -- which displays the frame 
buffer output of pipeline 180 on display 56. 

Figure 5 is a logical flow diagram of graphics processor 154. Main 
processor 1 10 may store graphics command streams 210. display lists 212 and 
vertex arrays 214 in main memory 1 12, and pass pointers to command processor 
10 200 via bus interface 150. The main processor 110 stores graphics commands in 
g one or more graphics first-in-first-out (FIFO) buffers 2 10 it allocates in main 
pj memory 1 10. The command processor 200 fetches: 

pi • command streams from main memory 1 12 via an on-chip FIFO memory 

fy buffer 216 that receives and buffers the graphics commands for 

u 1 5 synchronization/flow control and load balancing, 

j*j • display lists 212 from main memory 1 12 via an on-chip call FIFO 

g memory buffer 2 1 8 , and 

• vertex attributes from the command stream and/or from vertex arrays 2 14 
in main memory 1 12 via a vertex cache 220. 

20 Command processor 200 performs command processing operations 200a 

that convert attribute types to floating point format, and pass the resulting complete 
vertex polygon data to graphics pipeline 180 for rendering/rasterization. A 
programmable memory arbitration circuitry 130 (see Figure 4) arbitrates access to 
shared main memory 112 between graphics pipeline 180, command processor 200 

25 and display controller/video interface unit 164. 



18 



Figure 4 shows that graphics pipeline 180 may include: 

• a transform unit 300, 

• a setup/rasterizer 400. 

• a texture unit 500, 

5 • a texture environment unit 600, and 

• a pixel engine 700. 

Transform unit 300 performs a variety of 2D and 3D transform and other 
operations 300a (see Figure 5). Transform unit 300 may include one or more 
matrix memories 300b for storing matrices used in transformation processing 300a. 
^ 10 Transform unit 300 transforms incoming geometry per vertex from object space to 
J3 screen space; and transforms incoming texture coordinates and computes 

FU projective texture coordinates (300c). Transform unit 300 may also perform 

m 

M polygon clipping/culling 300d. Lighting processing 300e also performed by 

ill transform unit 300b provides per vertex lighting computations for up to eight 

15 independent lights in one example embodiment. Transform unit 300 can also 

ry perform texture coordinate generation (300c) for embossed type bump mapping 

p effects, as well as polygon clipping/culling operations (300d). 

Setup/rasterizer 400 includes a setup unit which receives vertex data from 
transform unit 300 and sends triangle setup information to one or more rasterizer 
20 units (400b) performing edge rasterization, texture coordinate rasterization and 
color rasterization. 

Texture unit 500 (which may include an on-chip texture memory (TMEM) 
502) performs various tasks related to texturing including for example: 

• retrieving textures 504 from main memory 112, 
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• texture processing (500a) including, for example, multi-texture handling, 
post-cache texture decompression, texture filtering, embossing, shadows 
and lighting through the use of projective textures, and BLIT with alpha 
transparency and depth, 

• bump map processing for computing texture coordinate displacements for 
bump mapping, pseudo texture and texture tiling effects (500b), and 

• indirect texture processing (500c). 

Graphics pipeline 180 includes a versatile texturing pipeline architecture that 
facilitates the implementation of various direct and indirect texturing features. As 
shown in Figure 5, the texturing pipeline basically comprises: 

• texture unit 500a for performing texture data lookup retrieval, 

• indirect texture/bump units 500b/500c for texture coordinate/texture data 
processing, 

• texture lookup data feedback path 500d and 

• texture environment unit 600 for staged color data and alpha 
(transparency data) blending. 

Reuse of units 500a, 500b, 500c can be used to provide a variety of 
interesting effects including multitexturing for example. Furthermore, the present 
invention supports indirect texturing through reuse/recirculation of these 
components. In an example hardware implementation, texture address 
coordinate/bump processing block 500b and indirect texture data processing block 
500c are portions of a single texture coordinate/data processing unit and the 
texturing pipeline is configured so as to allow retrieved texture indirect lookup data 
from texture unit 500a to be provided back via data feedback connection 500d to 



texture address coordinate/bump processor 500b/500c. The texture coordinate/data 
processing unit transforms texture data retrieved from an indirect texture lookup 
into offsets that are then added to texture coordinates for another (regular/non- 
indirect) texture lookup. 

5 Using the above described feedback path arrangement, retrieved texture data 

can effectively be "recirculated 11 back into the texture processing pipeline for 
further processing/computation to obtain new/modified texture lookup coordinates. 
This recirculated/recycled texture lookup data arrangement enables efficient and 
flexible indirect texture mapping/processing operations providing an enhanced 
10 variety of indirect texture applications. A few of the various applications of 
3 indirect texture mapping/processing which the texturing pipeline can provide 
include, for example: 

uJ • Texture warping 

fy • Meta-textures 

y, 15 • Texture tile maps 

^ • Pseudo-3D textures 

• Environment-mapped bump mapping 



Texture unit 500 outputs filtered texture values to the texture environment 
20 unit 600 for texture environment processing (600a). Texture environment unit 
(TEV) 600 blends polygon and texture color/alpha/depth, and can also perform 
texture fog processing (600b) to achieve inverse range based fog effects. Texture 
environment unit 600 can provide multiple stages to perform a variety of other 
interesting environment-related functions based for example on color/alpha 
25 modulation, embossing, detail texturing, texture swapping, clamping, and depth 
blending. Texture environment unit 600 can also combine (e.g., subtract) textures 
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in hardware in one pass. For more details concerning the texture environment unit 

600, see commonly assigned application serial no. , entitled "Recirculating 

Shade Tree Blender for a Graphics System" (attorney docket no. 723-968) and its 
corresponding provisional application, serial no. 60/226,888, filed August 23, 
5 2000, both of which are incorporated herein by reference. 

Pixel engine 700 performs depth (z) compare (700a) and pixel blending 
(700b). In this example, pixel engine 700 stores data into an embedded (on-chip) 
frame buffer memory 702. Graphics pipeline 180 may include one or more 
embedded DRAM memories 702 to store frame buffer and/or texture information 
10 locally. Depth (z) compares can also be performed at an earlier stage 700a' in the 
2 graphics pipeline 180 depending on the rendering mode currently in effect (e.g., z 
!<y compares can be performed earlier if alpha blending is not required). The pixel 
y engine 700 includes a copy operation 700c that periodically writes on-chip frame 
Jfj buffer 702 to main memory 1 12 for access by display/video interface unit 164. 
[.15 This copy operation 700c can also be used to copy embedded frame buffer 702 
^ contents to textures in the main memory 1 12 for dynamic texture synthesis effects. 
*~ Anti-aliasing and other filtering can be performed during the copy-out operation. 
O The frame buffer output of graphics pipeline 1 80 (which is ultimately stored in 

main memory 1 12) is read each frame by display/video interface unit 164. Display 
20 controller/video interface 164 provides digital RGB pixel values for display on 
display 102. 

Example Indirect Texturing - Logical Overview 

Figure 6 shows a logical overview of indirect texturing supported by system 
50. In accordance with the logical overview as illustrated by Figure 6, a rasterizer 
25 6000 may generate N sets of indirect texture addresses (coordinates), ADDR_A0 
through ADDR_A(N-1), and M sets of direct texture addresses (coordinates), 
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ADDRJBO through ADDR_B(M-1). The N sets of indirect texture coordinates, 
ADDR_A0 through ADDR_A(N-1), are passed to N corresponding logical texture 
lookup units 6002, 6004, 6006 (AO through A(N-l)). Each logical texture lookup 
unit (which, in one example implementation, is provided by reusing the same 
5 physical texture lookup units N times) uses the received indirect texture 

coordinates to look-up (retrieve) a texel value from a corresponding texture map — 
each of which may be a unique and different texture map. These lookup operations 
result in N sets of indirect texture lookup values, DATA_A0 through DATA_A(N- 
1), that are provided to a texture address processor (6008). Texture address 

10 processor 6008 also receives M sets of direct texture coordinate inputs. ADDR_A0 
to ADDR_A(N-1). 

The Texture Address Processor 6008 computes K sets of new/modified 
direct texture addresses (coordinates), ADDR_C0 through ADDR_C(K-1), based 
upon a predetermined function of the indirect texture lookup data values and the 

15 direct texture coordinates. Each of the K computed sets of direct texture 
coordinates (addresses), ADDR_C0 through ADDR_C(K-1), is passed to 
corresponding logical texture lookup units CO (6010) and CI (6012) through 
C(K-l) (6014). On one example implementation, these logical texture units 
CO-C(K-l) can be provided by reusing the same physical texture mapper used to 

20 provide logical texture units AO-A(N-l). Each texture lookup unit, CO through 
C(K-l), uses the received coordinates to look-up a texel value in a corresponding 
texture map. 

K sets of texture lookup values, DATA_C0 through DATA_C(K-1), 
resulting from the texture lookups are then provided to a pixel shader (6016). Pixel 
25 Shader 6004 receives the K sets of received texture values, along with zero, one, or 
more sets of rasterized (Gouraud shaded) colors. Pixel Shader 6016 then uses the 
received texture values, DATA_C0 to DATA_C(K-1), according to a 
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predetermined shading function to produce color output values that may be passed, 
for example, to a video display frame buffer. 

To aid in understanding, Figures 7 A and 7B, respectively, illustrate 
simplified examples of a regular (non-indirect) texture lookup operation and an 
5 indirect texture lookup operation. As shown in Figure 7A, a regular texture 
mapping lookup operation 950 may require specifying at least a set of regular 
(non-indirect) texture coordinates and a texture map ID as inputs. For this 
example, texture color data is retrieved (from the appropriate texture map) by 
texture unit 500a (Figure 5) and then provided to texture environment unit (TEV) 
1 0 600a for color blending . 

J In an example indirect texture lookup operation, as illustrated in Figure 7B, 

iy a texture mapping occurs in multiple stages/cycles. During a first stage, a set of 
n indirect texture coordinates are provided as inputs 901 along with a texture map ID 

corresponding to an indirect-texture mapping operation 952. In the next, or a 
[ 1 5 subsequent stage, data retrieved from the indirect texture lookup 952 is used in 
£ conjunction with a set of regular (non-indirect) texture coordinates 903 to produce 
J a set of new/modified coordinates 905 for another (regular) texture lookup 950 
0 using input texture map ID 907. During each stage, various programmable 

operations 909 may be performed on the retrieved data and coordinates to obtain 
20 the new/modified lookup coordinates. In one example implementation, blocks 952 

and 950 in the Figure 7B example are performed by the same hardware that is 

reused or recycled in a staged or recirculating manner. 

Example Direct and Indirect Texture Addressing 

Figure 8 shows an example basic physical configuration for implementing 
25 direct and indirect texturing while providing reuse of a physical texture mapping 
unit(s) in an interleaved fashion. Rasterizer 7000 sequentially generates all the 



24 



direct and indirect texture address coordinate sets associated with each pixel. In an 
example implementation, rasterizer 7000 provides coordinate sets in parallel for 
four pixels in a pixel tile (i.e., a 2x2 or other pixel region), but other 
implementations are possible. In the example shown, rasterizer 7000 first 
5 generates indirect texture coordinate sets per pixel (e.g., ADDR_A0 through 
ADDR_A(N-1)), followed by all direct texture coordinate sets per pixel (e.g., 
ADDR-B0 through ADDR_B(M-1)). Each pixel may have differing amounts of 
direct and indirect coordinate data associated with it depending on the texturing 
operations being performed. For example, certain pixels may not require indirect 
10 texture addressing and will not have associated indirect coordinate data while other 
pixels may require one or more indirect texture addressing operations and will 
have one or more corresponding sets of associated direct and indirect coordinates. 

In an example implementation of the texture processing circuitry of the 
graphics pipeline 180, texture processing is accomplished utilizing the same 

15 texture address processor and the same texture retrieval unit. To maximize 

efficient use of the texture processing hardware and avoid coarse granularity in the 
overall data processing flow through the pipeline, the processing of logical direct 
and indirect texture addresses (coordinates) and the lookup (retrieval) of texture 
data is performed in a substantially continuous and interleaved fashion. Indirect 

20 texture coordinate sets generated by rasterizer 7000 per pixel are passed directly to 
a single texture retrieval unit 7012 via switches SO (7002) and SI (7010), while 
non-indirect (logical direct) coordinate sets are placed in Direct Coordinate FIFO 
(dFIFO) 7006. 

In an example implementation of the graphics pipeline, texture retrieval unit 
25 7008 operates on at least one texture per clock and is capable of handling multiple 
texturing contexts simultaneously by maintaining state information and cache 
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storage for more than one texture. Retrieved indirect texture data, DATA_A0 
through DATA_A(N-1), is passed via feedback path 7018 to Indirect Data FIFO 
(iFIFO) 7004 via switch S2, where the retrieved indirect texture data is stored until 
needed. Direct texture coordinates are passed to Direct Coordinate FIFO (dFIFO) 
5 7006 via switch SO where they are stored until needed. In the above example 
discussed with respect to Figure 6, Indirect Data FIFO 7004 would receive the N 
sets of indirect texture data and Direct Coordinate data FIFO 7006 would receive 
the M sets of direct texture coordinates. Texture Address Processor 7008 would 
subsequently compute K new sets of texture coordinates based on a predetermined 
10 function of the input direct texture coordinates and the retrieved indirect texture 
O data. Since processing of logical direct and indirect coordinates is interleaved, 
^1 whenever there is more than one direct texture operation intervening between 
ft] successive indirect texture operations, the processing of direct coordinates may lag 
ffl behind with respect to the retrieval of corresponding indirect data. Consequently, 
7 15 buffers (e.g., FIFOs) 7004 and 7006 are provided to allow 
U synchronization/realignment of retrieved indirect texture lookup data with the 
g appropriate corresponding set of direct coordinates prior to both being provided 
S simultaneously to texture address processor 7008. 

The computed K sets of texture coordinates. ADDR_C0 through 
20 ADDR_C(K- 1 ) are output sequentially over K clocks. Switch S 1 (70 1 0) 

interleaves the computed texture coordinate data (sets) into the incoming indirect 
texture coordinate stream for providing to texture unit 7012. It does this by 
looking for unused or idle cycles ("bubbles") in the incoming indirect texture 
coordinate stream, and inserting the computed texture coordinate data (sets) during 
25 these cycles. Switch S2 (7014) routes the resulting texture lookup data, DATA_C0 
to DATA_C(K-1), as well as the rasterized colors to a pixel shader 7016. Pixel 
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shader (TEV) 7016 applies a predetermined shading function and outputs a single 
set of color values which may then be passed, for example, to a video display 
frame buffer. 

In an example hardware implementation, the operation of the texture address 
5 processor may be simplified by utilizing the following two exemplary operational 
constraints: 

1) The number of sets of computed texture coordinates, K, is equal to the 
number of sets of rasterized direct texture coordinates, M; and 

2) The value of a computed texture coordinate set, ADDR_C[i], is a 

p 10 function, f(a, b, c), of a direct texture coordinate set, ADDR_B[i], and one set of 
5 indirect texture data, DATA_A(j], together (optionally) with the computed result 

gas r t 

JJj from the previous processing stage, ADDR_C[i-l]. This operational relationship 
IS may be represented by the following equation: 

7 ADDR_C[i] = f(ADDR_B[i]4)ATA_A[j]^DDR_C[i-l]) 

q Figure 9 shows a logical diagram implementation of texture address 

(coordinate/data) processor 7008 according to the above operational relationship. 
Buffer 7009 stores and optionally provides a computed texture coordinate result 
from the previous cycle. Careful FIFO addressing ensures that the correct indirect 
20 texture DATA_A value is available at the proper processing cycle/stage. 

Example Interleaved Processing In Texture Processing Pipeline 

Figures 10A-10K illustrate the dynamic, interleaved operation of the Figure 
8 arrangement. These figures show a series of block diagrams illustrating an 
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example of the relative progression of pixel direct coordinate data and pixel 
indirect texture data at successive processing stages as a result of interleaved direct 
and indirect texture processing in the above example recirculating texturing 
pipeline embodiment of Figure 8. In this example illustration, a first pixel, PXO, 
5 requires an indirect texturing operation. As shown in Figure 10A, a set of indirect 
texture coordinates (PXO IND), and a corresponding set of direct texture 
coordinates (PXO DIR), associated with pixel PXO are provided by rasterizer (400) 
and shown as just entering the texturing pipeline at switch SO. 

In Figure 10B, indirect coordinates PXO IND for pixel PXO are provided 
10 directly to texture unit 500a via switches SO and SI. Direct coordinates PXO DIR 
J for pixel PXO, which follow the direct coordinates for pixel PXO, are provided via 
llj switch SO to Direct FIFO (dFIFO) 7004 for temporary storage while the indirect 
1* coordinates are processed. The texture unit 7012 performs an indirect texture 
jj lookup based on indirect coordinates PXO IND for pixel PXO to provide computed 
* 15 data (see Figure 10C) and, as shown next in Figure 10D, provides (recirculates) the 
jf; retrieved indirect texture lookup data, PXO DATA, back to the texturing pipeline 
to input, for example, via switch S2. As shown in Figure 10D, the recirculated 
P indirect texture lookup data, PXO DATA, is provided to indirect FIFO (iFIFO) 
where it is effectively paired (through synchronization between buffers 7004, 
20 7006) with the associated direct texture coordinate set, PXO DIR. The indirect 
texture lookup data, PXO DATA, and the direct texture coordinate set, PXO DIR, 
are ready to be processed together by the texture address processor to compute a 
new/modified texture address coordinate set for pixel PXO. The texturing pipeline 
consumes this pair of values from buffers 7004, 7006 by computing texture 
25 coordinates (see Figure 10E) which are used to map a texture and provide 
color/alpha texel data to the shader (see Figure 10F). 
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Referring back to Figure 10D ? suppose that the rasterizer continues to 
provide the texturing pipeline input at switch SO with sets of texture coordinates 
for texturing subsequent pixels. In this illustration, a series of subsequent pixels 
following pixel PXO, for example PXO through PX49, are to be textured using only 
5 direct texturing operations. The rasterizer provides the appropriate direct texture 
coordinate sets PX1 DIR through PX49 DIR per pixel, which are directed via 
switch SO to the direct coordinate FIFO (dFIFO) as shown in Figure 10E. Once 
the texture address processor has computed a new/modified texture address 
coordinate set for pixel PXO. it accepts direct texture coordinate set PX1 DIR (see 
10 Figure 10F). 

J As also shown by Figure 10F, suppose the rasterizer next provides a pixel, 

n j PX50, which follows pixel PX49, is to be textured using an indirect texturing 
^ operation. As illustrated by Figure 10G, switch SO provides the incoming indirect 
^ texture coordinate set, PX50 IND, for pixel PX50 directly to the texture retrieval 
* 15 unit 7012. Giving the indirect texture coordinates priority generally assures that 
the resulting indirect data from an indirect texture mapping will be present in 
buffer 7004 by the time it is needed (thus preventing pipeline stalling and wasted 
cycles). Note, however, that in this example, the indirect texture coordinates for a 
much later pixel in the rendering sequence (e.g., PX50) are being processed by 
20 texture retrieval unit 7012 before the retrieved unit processes an earlier pixel in the 
sequence (e.g., PX2). Such dynamic interleaving exhibited by the example texture 
processing pipeline has advantages in terms of efficiency. 

As shown in Figure 10G, the incoming direct texture coordinate set, PX50 
DIR, for pixel PX50 is provided via SO to direct coordinate FIFO (dFIFO) for 
25 buffering (as also shown in Figure 10G, a texture color, PX1 TEX Color, for pixel 
PX1 corresponding to direct coordinate set PX1 DIR is output at this point by the 



Hi 



29 



texture retrieval unit from the texture lookup and is directed via switch S2 to the 
pixel shader). 

Next as shown in Figure 10H, after the texture retrieval unit performs the 
texture lookup based on indirect coordinate set PX50 IND, the indirect texture 
5 lookup data. PX50, retrieved by the texture retrieval unit is recirculated via switch 
S2 to indirect texture data FIFO (iFIFO). Figure 10H also shows that the next 
direct coordinate set, PX2 DIR, in the stream from dFIFO is processed by the 
texture address processor and provided via switch SI to the texture retrieval unit. 

Figure 101 shows a texture color. PX2 TEX Color, for pixel PX2 
_ 10 corresponding to direct coordinate set PX2 DIR being outputted from the texture 
C= retrieval unit and directed to the pixel shader via switch S2. In the same 

ni processing stage, the next direct texture coordinate set, PX3 DIR, from dFIFO is 

iy 

y processed by the texture address processor and provided via switch SI to the 
m texture retrieval unit and indirect texture lookup data, PX50, is saved in iFIFO 
It, 15 awaiting the propagation of the corresponding PX50 direct coordinate through 
Sj buffer 7006. 

O As illustrated by the example in Figure 10J, the direct coordinate stream, 

PX5 DIR through PX49 DIR, in the dFIFO are processed in turn by the texture 
address unit and provided via switch SI to the texture retrieval unit 7012. Each of 

20 the retrieved texture colors corresponding to direct coordinate sets PX5 DIR 

through PX49 DIR in the stream are then provided in turn to the pixel shader via 
switch S2. Indirect texture lookup data, PX50 (which is being held in iFIFO 7004 
until it can be matched up with the corresponding direct coordinates, PX50 DIR, 
for pixel PX50 in the dFIFO) is finally ready to be released (as shown in Figure 

25 10K), once all the intervening direct coordinate sets, PX5 DIR through PX49 DIR, 
have been processed. The indirect texture data PX50 stored in the iFIFO can then 
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be paired with its corresponding direct coordinate set PX50 DIR and provided to 
the texture address processor for computing a new/modified texture coordinate set 
for pixel 50. 

Example More Detailed Implementation 

5 Figure 1 1 is a flow chart showing an example set of basic processing steps 

used to perform indirect texture mapping for an example implementation. Most of 
the steps of Figure 1 1 are set up by a general indirect texturing API (application 
program interface) function 801 that sets parameters for processing an indirect 
texture by texture unit 500 and texture environment unit 600. 

ri 10 System 50 first stores a texture image/data in texture memory 502 for use as 

J: an indirect texture (block 800). Based on one or more API command functions 
|*( (blocks 802-810), commander processor 200 then provides a specified set of 
^ indirect texture coordinates to texture retrieval unit 500a (see Figure 5) which 
HJ accesses texture memory 504 and retrieves indirect texture lookup data (Figure 7B 
hi 5 block 952). In the example embodiment, one or more API indirect texture 
ry function(s) 801 allow a graphics application to set up associations between texture 
y maps and texture coordinates and to specify which sets of texture maps and 

coordinates are to be used when performing indirect and direct texture referencing 
operations. For example, in a basic indirect texturing operation, one or more sets 
20 of indirect-texture coordinates are specified (block 802), one or more texture maps 
are identified (block 804), parameters for computing new/modified texture 
coordinates and the processing order and number of indirect-texture references are 
specified (block 806), one or more texture maps are identified as indirect textures 
(block 808) and a set of indirect-texture coordinates is associated with an indirect- 
25 texture map (block 810). 
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The data retrieved from the indirect-texture lookup operation is 
"recirculated" back to the same texture address (coordinate) bump/processing 
circuitry 500b/500c via feedback connection 500d for further processing. Texture 
bump/processing circuitry 500b/500c then use the retrieved indirect-texture lookup 

5 data as coordinates offset factors in computing new texture coordinates based upon 
a current regular (non-indirect) texture coordinate and/or pre-defined texture 
scaling, bias and rotation data (block 812). The new/modified coordinates are then 
used as regular direct (non-indirect) coordinates for mapping a texture to a polygon 
(block 814; Figure 7B block 950). Alternatively, these new/modified coordinates 

10 can be re-used again for additional/subsequent coordinate operations via a further 
recirculation of the texture mapping hardware in a yet further texture mapping 
operation. In this manner, multiple levels of indirection can be staged and 
processed using a single texture coordinate bump/processing circuit 500b/500c and 
a single texture retrieval unit 500a. Ultimately, retrieved texture lookup data is 

15 provided to texture environment unit 600 for staged color blending, for example, 
with other textures. 

Example Coordination of Texture Operations With Shader Operations 

In an example implementation of system 50, the indirect and direct texturing 
operations described above are coordinated with corresponding stages of a 
20 recirculating shader within texture environment unit 600. See commonly assigned 

copending application Serial No. "Recirculating Shade Tree 

Blender For A Graphics System" (atty. dkt. 723-851). 

Figure 12 is a functional diagram illustrating an example of how to set up 
regular texture mapping to coordinate with shader stages. Texture lookup 
25 parameters specified, for example, by an API function(s), identify stored texture 
maps associated with sets of regular (non-indirect) texture coordinates. Texture 
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lookups are performed using sets of texture coordinates to access the identified 
texture map(s) stored in memory. The retrieved texture lookup data is then passed 
to TEV unit 600 for color blending. 

In an example embodiment of the present invention, TEV unit 600 allows 
5 programmable color data blending operations for accomplishing polygon texturing 
and shading during discrete processing stages. These stages are pre-defined by an 
appropriate API command function. In the example embodiment, up to sixteen 
TEV processing stages can be pre-defined. Each stage is assigned a processing 
order ID (number) and processed in sequence. In this example, selected TEV 
10 processing stages 910 are associated with a set of texture lookup parameters 912 
J specifying a regular texture lookup operation using a texture coordinate ID 914 and 
f a an associated texture map ID 916. The appropriate texture is looked up using the 
ifi associated coordinates and the retrieved texture data is provided for the 
}Ji corresponding TEV stage blending. While Figure 8 reflects that the example 
"15 embodiment provides up to eight textures and up to sixteen TEV stages, any 
£ number can be used in alternate implementations. 

q The list of texture coordinate/texture map pairs are processed by 

recirculating texture unit 500 and texture environment unit 600 in an order 
specified by a GXSetTevOrder command using a number of recirculating stages as 

20 set by the GXSetNumTev stages command. In the particular example shown in 
Figure 8, a set of texture coordinates designated by an API command, 
GXJTEXCOORD IDs no.7 are used to perform a direct texture mapping using a 
texture map no.3 designated by a GXJTEXMAP IDs API command. The results 
of that particular texture lookup in this example are passed to the texture 

25 environment unit 600 for processing in a TEV stage zero, as designated by a 
GEXSetTevOrder API command. In this example, another set of texture 
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coordinates designated no. 2 are used in a further texture mapping process using a 
texture map no. 4 in a further texture mapping stage (e.g., stage no. 3), and the 
results of this texture lookup are further processed by the texture environment unit 
600 in a TEV processing stage no. 3. Figure 12 shows a further example texture 
5 lookup stage no. 5 using a set of texture coordinates identified by identification no. 
3 and a texture map identified by identification no. 4 in a fifth texture lookup stage, 
and the results of this texture lookup are processed by TEV stage no. 5. Similarly, 
a texture coordinate set no. 5 is used to lookup a further texture map no. 6 and a 
seventh texture lookup and associated TEV processing stage; and a set of texture 
10 coordinates no. 6 are used to lookup a texture map no. 7 in a ninth texture lookup 
0 stage and the results of this texture lookup are processed using a ninth TEV 
N processing stage. The GXSetTevOrder API command specifies the order in which 
fU the various texture coordinate/texture map identification pairs are processed by the 
|B texture unit 500 and the texture environment unit 600. The particular order and the 
; 15 particular IDs shown in Figure 12 are by way of example only. 

^ Figure 13 is a functional diagram illustrating examples of how to set up a 

W sequence of regular and indirect texture processing to coordinate with recirculating 

n 

O shader stages. In this example, selected TEV processing stages are associated with 
a set of texture lookup parameters 911 specifying both regular and indirect texture 
20 lookup operations. For indirect mapping a set of indirect lookup parameters 913 
associate an indirect-texture coordinate ID 904 with a corresponding texture map 
ID 916. In this case, the texture coordinate is specified as an "indirect" coordinate. 
An indirect-texture lookup is performed and the indirect texture lookup results are 
combined with a regular texture coordinate to obtain a new/modified texture 
25 coordinate. The new/modified texture coordinate is then used to access an 
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associated texture map 916 and the resulting texture lookup data is provided for the 
corresponding TEV stage. 

In the Figure 13 example, additional API commands GXSetlndTexOrder, 
GXSetNumlndStages and GXSetTevIndirect are used to invoke the indirect texture 
5 lookup operations. In the particular example shown, texture coordinate/texture 
map pair no. 0 are used to perform an indirect texture lookup and the result of that 
indirect texture lookup is combined by operator 909 (see Figure 7B) with the set of 
texture coordinates designated no. 7 to perform a direct texture lookup using 
texture map designated no. 3 in a recirculating processing stage 0. In a similar 
10 manner, a texture coordinate/texture map pair no. 1 are used to perform an indirect 
A texture lookup to yield retrieved data that operator 909 combines with a texture 

ry coordinate set no. 2 to perform texture mapping on a corresponding direct texture 

?=■ ; * 

y map no. 4. In the particular Figure 13 example shown, a further indirect texture 

lookup using a texture map no. 2 and a set of texture coordinates no. 4 provide 
U 15 indirect texture lookup results that operator 909 uses for two successive direct 
^ texture lookups using a texture coordinate no. 3/texture map no. 5 pair (in TEV 
^ processing stage no. 5) and a set of texture coordinates no. 5/texture map no. 6 pair 
^ in a seventh TEV processing stage. The particular texture coordinate ID numbers, 
texture map ID numbers and TEV processing stage numbers shown in Figure 13 
20 are by way of example only. 

Example API Indirect Texture Function Commands 

As shown in Figures 12-13, one or more graphics API functions are used to 
set up and initiate direct and indirect texturing operations. Example API functions 
for setting up indirect texture operations and parameters may be defined as 
25 follows: 
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GXSetlndTexOrder 

This function is used to specify the texture coordinate and texture map to be 
used with a given indirect lookup. 

Arguments: 

ind_stage The indirect stage that is being affected. 

tex_coord The texcoord to be used for this stage. A given texcoord can be 

shared by an indirect and regular stage at the same time. 

tex_map The texture map to be used for this stage. A given texture map 

can only be indirect or regular, not both. 

5 

In more detail, example arguments are: 

u8 Numlndtex; II Number of indirect textures. 

GXIndTexStagelD IndexTesIdf ]; II Indirect texture stages being affected. 

GXTexMapID Texjnapl ]; II Indirect texture map (ID) to be used for 

this stage. 

GXTexCoordID Tex_coord[ ] II Associated texture coordinate for each 

indirect texture map. 

The above function associates a specified texture map and a texture 
coordinate with an indirect texture map ID name. It is used to specify a texture 
coordinate and a texture map to use with a given indirect lookup. In one example 
10 embodiment, a specified texture map is used as either an indirect or a direct 
texture, but not both, although alternative arrangements are possible. 

Example Usage : 

void GXSetIndTexOrder(GXIndTexStageID ind^stage, 

GXTexCoordID tex_coord, 
1 5 GXTexMapID tex_map); 



GXSetTevIndirect (GXSetlndirectTexture) 
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This is the general-purpose function used to control how the results from an 
indirect lookup will be used to modify a given regular TEV stage lookup. 

Arguments: 



matrix_sel Indicates which indirect matrix and scale value to multiply the 
offsets with. 

wrap__s Indicates the wrapping factor to use with the S component of the 

regular texture coordinate. 

wrap_t Indicates the wrapping factor to use with the T component of the 

regular texture coordinate. 

add_prev Indicates whether the texture coordinate results from the previous 
TEV stage should be added in. 

utc Jod Indicates whether to use the unmodified (GXJTRUE) or modified 

(GXJFALSE) texture coordinates for mipmap LOD computation. 

alpha_sel Indicates which offset component will supply the indirect "bump" 
alpha, if any. 

In more detail, example arguments are: 



GXTevStagelD 


TevStageld; 


II 


TEV color combining stage ID name. 


GXIndTexStagelD 


indStage; 


II 


indirect tex stage used with this TEV stage. 


GXIndTexFormat 


Fmt; 


II 


format of indirect texture offsets. 


GXIndTexBiasSel 


Bias; 


II 


Selects which offsets (S, T) receive a bias. 


GXIndTexATphaSel 


AlphaSel; 


II 


Selects indirect texture alpha output. 


GXIndMtxID 


MatrixSel; 


II 


Selects which texture offset matrix and scale. 


GXIndTexWrap 


WrapS; 


II 


Wrap value of direct S coordinate. 



tev_stage 
ind_stage 
format 



bias sel 



The TEV stage that is being affected. 

The indirect stage results to use with this TEV stage. 

Indicates how many bits to extract from the indirect result color to 
use in indirect offsets and the indirect "bump" alpha. 

Indicates whether or not a bias should be applied to each 
component of the indirect offset. 
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GXIndTexWrap 
GXBool 



WrapT; 
IndLOD; 



II Wrap value of direct T coordinate 

// Use modified texture coordinates for level of 
detail (LOD). 

// Add output from previous stage to texture 
coordinates. 



GXBool 



AddPrev; 



The above function allows setting all of the various parameters for 
processing a given indirect texture associated with a particular TEV stage. The 
function associates an indirect texture map with a TEV color combining stage, 
specifies how the retrieved indirect-texture lookup data (color values) will be 
converted to texture coordinate offsets (i.e.. 3. 4. 5 or 8 bit format), selects texture 
offset matrix and texture scaling values, specifies texture-coordinate wrap 
parameters and whether the computed new/modified coordinates should be used 
for level of detail (LOD) with mip-mapped textures. The function also allows 
selecting whether the computed output from the texture processing logic 512 (see 
below) during a previous stage is added to text coordinate in a current stage. 

Example Usage : 

void GXSetTevIndirect (GXTevStagelD tev_stage, 

GXIndTexStagelD ind_stage, 
GXIndTexFormat format, 
GXIndTexBiasSel bias_sel, 
GXIndTexMtxID matrix_sel, 
GXIndTexWrap wrap_s, 
GXIndTexWrap wrap_t 
GXBool add_prev, 
GXBool utcjod, 
GXIndTexAlphaSel alpha_sel); 
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GXSetlndTexMtx 

This function lets one set one of the three static indirect matrices and the 
associated scale factor. The indirect matrix and scale is used to process the results 
of an indirect lookup in order to produce offsets to use during a regular lookup. 
The matrix is multiplied by the [S T U] offsets that have been extracted (and 
optionally biased) from the indirect lookup color. In this matrix- vector multiply, 
the matrix is on the left and the [S T U] column vector is on the right. 

The matrix values are stored in the hardware as a sign and 10 fractional bits 
(two's complement). Thus the smallest number that can be stored is -1 and the 
largest is (1 - 1/1024) or approximately 0.999. Since -hi cannot be stored, you may 
consider dividing all the matrix values by 2 (thus +1 becomes +0.5) and adding one 
to the scale value in order to compensate. 

In more detail, example arguments are: 

GXIndTexMtsID Mtxld; II Texture offset matrix name. 

F32 Offset Matrix [ 3 ][2 ]; II Matrix elements for the texture offset 

matrix. 

GXIndTexFormat Fmt; II Exponent value for scaling 

(scale = 2 ScaleExp ). 

The above example API function sets matrix M and scale values in lookup 
data processing logic (proc) 512. The retrieved indirect texture lookup data (e.g. 
texture coordinate offsets s, t, u) is multiplied by Offset Matrix 525 (M) and the 
scaling factor 526. The OffsetMatrix is an API function parameter specifying the 
3x2 element matrix elements used within indirect processing logic 512 (see 
below). In a preferred embodiment, the matrix elements are within the range 
(-1, 1). ScaleExp is a parameter specifying power-of-two exponent used for setting 
the scale factor. The preferred range of ScaleExp is (-32, 32). 
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Example usage : 

void GXSetIndTexMtx( GXIndTexMtxID mtx_sel 

f32 offset.mtx[2][3], 
s8 scale_exp) 

GXSetlndTevOrder 

The above function associates a regular non-indirect texture map and a 
texture coordinate with an indirect texture map ID name. 

GXSetTevStagelD stage 
GXSetTexCoordID coord 
GXChannellD color 

Example usage: 

void GXSetlndTexOrder (GXIndTevStagelD tev__stage ? 

GXTexCoordID cood. 
GXChannellD color) ; 

GXSetNumlndStages 

This function is used to set how many indirect lookups will take place. The 
results from these indirect lookups may then be used to alter the lookups for any 
number of regular TEV stages. 

GXSetNumlndStages u8 stages 

The above function sets the number of indirect texture lookup stages. 

Example usage : 

void GXSetNumIndStages( u8 nstages ); 



GXSetNumTevStaees 

This function enables a consecutive number of Texture Environment (TEV) 
stages. The output pixel color (before fogging and blending) is the result from the 
last stage. The last TEV stage must write to register GXJTEVPREV, see 
5 GXSetTevColorOp and GXSetTevAlphaOp. At least one TEV stage must be 
enabled. If a Z-texture is enabled, the Z texture must be looked up on the last 
stage, see GXSetZTexture. 

The association of lighting colors, texture coordinates, and texture maps 
with a TEV stage is set using GXSetTevOrder. The number of texture coordinates 
10 available is set using GXSetNumTexGens. The number of color channels 
available is set using GXSetNumChans. 

GXInit will set nStages to 1 . 

Arguments : 

nStages Number of active TEV stages. Minimum value is 1, maximum value is 16. 

15 In more detail: 

GXSetNumTevStages u8 stages 

The above function sets the number of TEV color blending stages. This 
function sets parameters associated with the amount of recirculation being 
performed by texture unit 500 and texture environment unit 600, as well as the 
sequence the recirculating stages are performed in. 

20 Example usage : 

void GXSetNumTevStages( u8 nStages ); 
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GXSetlndCoordTexScale 

This function is used when one wishes to share a texcoord between an 
indirect stage and a regular TEV stage. It allows one to scale down the texture 
coordinates for use with an indirect map that is smaller than the corresponding 
regular map. 

Arguments : 

ind_stage The indirect stage that is being affected, 
scale_s The scale factor for the S coordinate, 

scale J; The scale factor for the T coordinate. 

In more detail, example arguments are: 

GXIndTexMapID IndTexId; II Indirect texture name. 

GXIndTexScale Scale S; II Scale value for S coordinate. 

GXIndTexScale ScaleT; II Scale value for T coordinate. 

The above function sets a value for scaling the indirect texture coordinates. 
The texture coordinates are scaled after a perspective divide and before addition to 
the regular non-direct texture coordinates. 

Example Usage: 

void GXSetlndTexCoordScale (GXIndTexStagelD ind^stage, 

GXIndTexScale scalers, 
GXIndTexScale scale_t); 
This function is used when one wishes to use the same texture coordinates 
for one TEV stage as were computed in the previous stage. This is only useful 
when the previous stage texture coordinates took more than one stage to compute, 
as is the same for GXSetTevIndBumpST. 
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Example Arguments: 

tev_stage The TEV stage that is being changed. 
Example Usage: 

void GXSetTevIndRepeat( GXTevStagelD tev_stage ); 
GXSetTevIndBumpST 

This function sets up an environment-mapped bump-mapped indirect 
lookup. The indirect map specifies offsets in (S, T) space. This kind of lookup 
requires 3 TEV stages to compute. The first two TEV stages should disable texture 
lookup. The third stage is where the lookup is actually performed. One may use 
GXSetTevIndRepeat in subsequent TEV stages to reuse the computed texture 
coordinates for additional lookups. The surface geometry must provide 
normal/binormal/tangents at each vertex. 

Example Arguments : 

tev_stage The TEV stage that is being affected. 

ind_stage The indirect stage results to use with this TEV stage. 

matrix_sel Indicates which indirect scale value to multiply the offsets with. 

Example Usage : 

void GXSetTevIndBumpST (GXTevStagelD tev__stage, 

GXIndTexStagelD ind_stage, 
GXIndTexMtxID matrix_sel ); 

GXSetTevIndBumpXYZ 

This function sets up an environment-mapped bump-mapped indirect 
lookup. The indirect map specifies offsets in object (X, Y, Z) space. This kind of 
lookup requires only one TEV stages to compute. The indirect matrix must be 
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loaded with a transformation for normals from object space to eye space. The 
surface geometry need only provide regular normals at each vertex. 

Example Arguments : 

tev_stage The TEV stage that is being affected. 

ind_stage The indirect stage results to use with this TEV stage. 

matrix_sel Indicates which indirect matrix and scale value to multiply the offsets with. 

Example Usage: 

5 void GXSetTevIndBumpXYZ( GXTevStagelD tev_stage, 

GXIndTexStagelD ind_stage, 

0 GXIndTexMtxID matrix_sel ); 

fi GXSetTevDirect 

ry This function is used to turn off all indirect offsetting for the specified 

: ~ : 

2 10 regular TEV stage. 

: Example Arguments : 

Q tev_stage The TEV stage that is being changed. 

□ Example Usage : 

void GXSetTevDirect( GXTevStagelD tev_stage ); 

GXSetTevIndWarp 

15 This function allows an indirect map to warp or distort the texture 

coordinates used with a regular TEV stage lookup. The indirect map should have 
8-bit offsets, which may be signed or unsigned. "Signed" actually means "biased," 
and thus if signed_pffsets is GXJTRUE, 128 is subtracted from the values looked 
up from the indirect map. The indirect results can either modify or completely 
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replace the regular texture coordinates. One may use the indirect matrix and scale 
to modify the indirect offsets. 

Arguments : 

tev_stage The TEV stage that is being affected. 

ind_stage The indirect stage results to use with this TEV stage. 

signed_offsets Indicates whether the 8-bit offsets should be signed/biased 
(GX_TRUE) or unsigned (GX_FALSE). 

replace_mode Indicates whether the offsets should replace (GX_TRUE) or be added 
to (GX_FALSE) the regular texture coordinates. 

matrix__sel Indicates which indirect matrix and scale value to multiply the offsets 
with. 

2 Example Usage : 

W 5 void GXSetTevIndWarp( GXTevStagelD tev_stage, 

UJ GXIndTexStagelD ind_stage. 

RJ GXBool signed_offsets ? 

U GXBool replace_mode, 

fU GXIndTexMtxID matrix_sel); 

0 10 GXSetTeylndTile 

This function may be used to implemented tiled texturing using indirect 
textures. Note that the regular texture map only specifies tile definitions. The 
actual number of texels to be applied to the polygon is a function of the base tile 
size and the size of the indirect map. In order to set the proper texture coordinate 
15 scale, one must call GXSetTexCoordScaleManually. One can also use 

GXSetlndTexScale in order to use the same texcoord for the indirect stage as the 
regular TEV stage. 

Example Arguments : 
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tev_stage The TEV stage that is being affected. 

ind_stage The indirect stage results to use with this TEV stage. 

tilesize_s Indicates the size of the tile in the S dimension. 

tilesize_t Indicates the size of the tile in the T dimension. 

Tilespacing_s Indicates the spacing of the tiles in the S dimension. 

Tilespacing_t Indicates the spacing of the tiles in the T dimension. 

Format Indicates which indirect texture format to use. 

matrix_sel Indicates which indirect matrix and scale value to multiply the offsets 
with. 

bias_sel Indicates the tile stacking direction for pseudo-3D textures. 

alpha„sel Indicates which offset component will supply the indirect "bump" 

alpha, if any (for pseudo-3D textures). 

Example Usage : 

void GXSetTevIndTile( GXTevStagelD tev^stage, 

GXIndTexStagelD ind__stage, 
ul6 tilesize_s, 

5 ul6 tilesize_t. 

u!6 tilespacing_s, 
ul6 tilespacingj;, 
GXIndTexFormat format, 
GXIndTexMtxID rnatrix_sel, 
1 0 GXIndTexBiasSel bias_scl, 

GXIndTexAlphaSel alpha_sel); 

GXSetTevIndRepeat 

This function is used when one wishes to use the same texture coordinates 
for one TEV stage as were computed in the previous stage. This is useful when 
15 texture coordinates require more than one processing cycle/stage to compute. 
Example Arguments : 
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Tev_stage The TEV stage that is being changed. 
Example Usage : 

void GXSetTevIndRepeat ( GXTevStagelD tev_stage ) ; 

GXSetAlphaCompare 

This functino sets the parameters for the alpha compare function which uses 
the alpha output from the last active Texture Environment (TEVk) stage. The 
number of active TEV stages are specified using GXSetTevStages. 

The output alpha can be used in the blending equation (see 
GXSetBlendMode) to control how source and destination (frame buffer) pixels are 
combined. 

The alpha compare operation is: 

alpha_pass = (alpha_src (compO) (refO) (op) (alpha_src (comp) refl) 

where alpha_src is the alpha from the last active Tev stage. As an example, 
you can implement these equations: 

alpha_pass = (alpha_src > refO) AND (alpha_src < refl) 

or 

alpha_pass = (alpha_src > refO) OR (alpha_src < refl) 

The Z compare may occur either before or after texturing. In the case where 
Z compare is performed before texturing, the Z is written based only the Z test. 
The color is written if both the Z test and alpha test pass. 

When Z compare is done after texturing, the color and Z are written if both 
the Z test and alpha test pass. When using texture to make cutout shapes (like 
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billboard trees) that need to be correctly Z buffered, one should perform Z 
buffering after texturing. 

Example Arguments : 

compO Comparison subfunction 0. 

refO Reference value for subfunction 0. 8-bit. 

op Operation for combining subfunctionO and subfunction 1 . 

Accepted values are: GX_AOP_AND, GX_AOP_OR. 
GX_AOP_XOR, GX_AOP_XNOR. 

compl Comparison subfunction 1. 

refl Reference value for subfunction L 8-bit. 

Example Usage : 

void GXSetAlphaCompare ( 

GXCompare cornpO, 

u8 refO, 

GXAlphaOp op, 

GXCompare compl, 

u8 refl ): 

Example Hardware Implementation: 

In one preferred example embodiment, texture unit 500 and texture 
environment unit 600 have been implemented in hardware on a graphics chip, and 
have been designed to provide efficient recirculation of texture mapping operations 
as described above. In more detail, the texture address coordinate/bump 
processing block 500b/500c is implemented in hardware to provide a set of 
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appropriate inputs to texture mapping block 500a and texture environment block 
600a. Blocks 500b, 500c in conjunction with sequencing logic use to recirculate 
blocks 500a, 600a present a sequence of appropriate inputs at appropriate times 
with respect to various recirculating stages to efficiently reuse blocks 500a, 600a - 
5 in some cases creating a feedback loop via path 500d wherein the output of block 
500a is modified and reapplied to its input in a later sequential recirculating 
processing stage. This results in a logical sequence of distinct texture processing 
stages that, in the preferred embodiment, are implemented through 
reuse/recirculation of the same hardware circuits over and over again. The 
10 resulting functionality provides any desired number of logical texture mapping 
O processing stages without requiring additional hardware. Providing additional 
hardware for each of the various texture processing stages would increase speed 

ry 

FU performance but at the penalty of additional chip real estate and associated 
00 complexity. Using the techniques disclosed herein, any number of logical texture 
s 15 mapping stages can be provided using a single set of texture mapping hardware. 
y* Of course, in other implementations to improve speed performance, it would be 
m possible to replicate the texture mapping hardware so that multiple texture 
q mapping stages could be performed in parallel rather than in seriatim as shown in 
Figure 12 and 13. Even in such alternative implementations providing multiple 
20 sets of the same or different texture mapping hardware, providing the recirculating 
and indirect texture mapping techniques disclosed herein would be quite valuable 
in expanding functionality and flexibility of more generic hardware to provide a 
particular sequence of possibly involved and complicated texture mapping 
operations or stages specified by an application programmer to achieve particular 
25 advanced texture mapping effects. 
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Figures 14 and 15 show one example implementation of particular hardware 
used to collect and present various parameters to texture unit 500 for logical direct 
and/or indirect texture mapping lookup. Figure 14 shows a block diagram of an 
example hardware implementation of texture coordinate/bump processing 
5 hardware 500b/500c, and Figure 15 shows an example processing and computation 
logic within the Figure 14 example implementation. In the preferred 
implementation, the particular functions performed by the Figure 14/Figure 15 
hardware are controlled by control data passes down the graphics pipeline to the 
hardware control logic registers 503 within coordinate/bump processing unit 
10 500b/500c. Logical functions and computational operations occurring within each 

P unit in graphics pipeline 180 are determined by control codes (bits) provided by 

command processor 200 in register data packets that are distributed throughout the 

fy graphics pipeline. The appropriate control codes/values are placed in control logic 

ffl registers within each unit for controlling that input during one or more pipeline 

u 15 clocking cycles. 

^ Referring to the Figure 14 high level block diagram of an example hardware 

® implementation of texture coordinate processing/bump block 500b/500c. Logical 
O direct and indirect texture coordinates and hardware control register data are 

passed to texture coordinate processing/bump unit 500b/500c from rasterizer 400 
20 (see Figure 5) over graphics pipeline 180 data bus lines 501 (xym). Graphics 
pipeline global command data on pipeline command bus 505 (cmd) specifies 
whether incoming texture coordinates are "direct" or "indirect". Direct texture 
coordinates (i.e., regular non-indirect texture coordinates) and indirect texture 
coordinates are provided from a rasterizer via pipeline data bus lines 507 (st) and 
25 are processed together in a substantially continuous interleaved fashion so as to 

maintain a fine granularity of processing throughout the graphics pipeline. Indirect 
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texture lookup data is retrieved from texture unit 500a and recirculated" back to 
coordinate processing/bump unit 500b/500c via texture color/data bus 518 (col) 
corresponding to data feedback path 500d (Figure 5). 

Control register data received from command processor 200 is stored in 
5 registers 503 for controlling indirect texturing operations. Some of the stored 
control register data is utilized, for example, for selecting and controlling various 
computational operations that take place within coordinateAookup data processing 
logic 512 (proc) (as indicated, for example, by register data lines 520 in Figure 15). 
A command decoder and synchronizing circuit, syncO (502). determines whether 
10 incoming data on lines 501 (xym) are a set of direct texture coordinates, indirect- 
texture coordinates, or control logic register packets. Incoming direct coordinates 
are routed to a FIFO unit 506 {dfifo) for further processing by data synchronizing 
circuit 510 (syncl) and processing logic unit (proc) 512. Incoming indirect 
coordinates are routed directly to an output data synchronizing circuit 504 (syncl) 
j\. 15 for passing on to texture unit 500a (Figure 5). Synchronizing circuits syncO (502), 
l! syncl (504) and syncl (510) perform data routing and synchronization to control 
JjJ the timing and handling of indirect and direct texture coordinate data from the 
O rasterizer (400) and retrieved indirect texture lookup data from the texture unit. 
Effectively, synchronizing circuits syncO (502), syncl (504) and syncl (510) 
20 perform the respective functions of switches SO, SI and S2 in Figure 8. 

Incoming "recycled" indirect texture lookup data received via texture 
color/data feedback bus 518 (col) from texture unit 500a is placed in FIFO unit 508 
(ififo)- Direct texture coordinates are aligned at the st output of FIFO unit 506 
(dfifo) with the incoming indirect texture lookup data at the col output 5 19 of FIFO 
25 unit 506 (dfifo). Synchronizing circuit 510 (syncl) performs further coordinate 
data alignment and assembles a complete set of operands to provide to processing 
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unit 512 (proc) for indirect texture processing operations based on the control logic 
codes stored in registers 503. These operands include, for example, multiplication 
coefficients/constants for the texture offset matrix elements and lookup data 
formatting parameters for performing texture coordinate computations within 
5 processing unit 512 (proc). After coordinate data and retrieved indirect-texture 
lookup data is processed by proc unit 512, the resulting data (e.g., new/modified 
texture coordinates) is passed to synchronizing circuit 504 (synel), where the data 
is interleaved with a stream of indirect texture coordinates from synchronization 
unit 502 circuit (syncO) and provided to texture retrieval unit 500a. 

10 Referring now to the Figure 15 example of the processing and computational 

logic within indirect-texture lookup data/coordinate processing unit 512 (proc), 
retrieved indirect-texture lookup data is in the form of data triplets having three 
data components (also referred to as s. t and u texture offsets). The number of data 
bits per component will be dependent in part upon the manner and purpose for 

15 which the particular indirect-texture map is used in an indirect texturing operation. 
Indirect-texture lookup data may also consist of "bump alpha data" used elsewhere 
in the graphics pipeline for transparency shading/blending. Retrieved indirect- 
texture lookup data (e.g., s, t and u offset data) on col bus 519 is first passed 
through Format selection block 521 which selects whether retrieved indirect 

20 texture lookup data is "bump alpha" data or is to be processed as multi-bit binary 
data triplets of three, four, five, or eight bits. The Format selection block provides 
offset data triplets to bias unit 523 and "bump alpha" data is routed to bump alpha 
select multiplexer 532 for use in transparency blending elsewhere in the graphics 
pipeline. 

25 In a preferred example embodiment of the present invention, the Format 

unit logic extracts three texture offset data components of 8, 5, 4, or 3 -bits from a 
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24-bit data triplet on the col input bus 519 and extracts 5 -bit bump-alpha select 
values (bs L bL and bu)fox possible output on the xym bus. Bypass multiplexer 532 
is provided to allow selection of one bump-alpha value, bs, bt, or bu* to be output 
on the pipeline xym bus. An optional bias value may be applied to the data triplets 
5 by bias unit 523. For example, if eight-bit data triplet components were selected, 
then a bias of -128 could be applied by bias unit 523 to allow for signed offsets. 
(If data triplet components of less than eight bits are used, then a bias of +L for 
example, is applied). 

A matrix select multiplexer 524 allows loading selected direct coordinates or 
10 constants for performing a matrix multiplication operation 525. In addition, a 

■? 5 

J modulo wrap unit 527 is provided to optionally perform coordinate wrap 

ry operations on an associated regular direct texture coordinate. For example, using 

an API function, one may specify a wrap value of 0. 16, 32, 64, 128, or 256. 

rU A matrix multiplication operation 525 is performed on a data triplet using 

iff 

U 15 matrix elements M. For example, the data triplet is loaded into a three-element 
m vector data register V associated with matrix multiplication operation 525 and then 
p multiplied by matrix elements M (Figure 12A). Matrix multiplication operation 
525 may be used, for example, for rotation, scaling, and re-mapping of s, t and u 
texture offset triplets retrieved from an indirect-texture via col bus 519. Values of 
20 matrix elements M are variable and may be dynamically defined for each texture 
processing cycle/stage using selected matrix configurations. In a preferred 
embodiment, the multiplication matrix is a 3 by 2 element matrix having a 
programmable configuration selected either from one of three constant matrices 
comprising elements defined from selected logic control register data 520 or, 
25 alternatively, selected from one of two "variable" matrices having elements derived 
from the current direct texture coordinates obtained via the pipeline st coordinate 
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data bus 522. Using an appropriate API function, one may predefine up to three 
different static matrices or two different variable matrices and select which matrix 
is to be used for a given indirect texture operation. 

Figure 16 A illustrates an example texture offset matrix multiplication 
operation using matrix programmable static constant elements ma, pb, mc, md, 
me and mf . Retrieved indirect-texture lookup data comprising data triplet 
components s, t and u provide texture offset values are used as a multiplicand 
column vector matrix for the operation. The resultant product values are a pair of 
new/modified texture coordinates s T and t f . In a preferred arrangement, elements 
of the matrix are represented by multi-bit binary floating point values within a 
range of -1 to +1. Figure 16B illustrates two example "variable" matrices, Matrix 
A and Matrix B, having elements derived from current direct texture coordinates 
(s, t) obtained via the pipeline st coordinate data bus 522. 

Referring once again to Figure 15, a scaling logic 526 is provided for 
performing optional scaling operations. Scale unit 526 scales the results of the 
matrix multiplication. The amount of scale is a power of 2 specified by the user. 
For example, using an API function, one may choose a scale value by specifying 
an exponent of 2 in the range of -32 to +3 1 . This scale value can be used, for 
example, to stretch texture offsets over the size of a regular texture map that is 
associated with the indirect-texturing operation. 

Wrap logic 527 optionally applies a (modulo) wrap to the direct texture 
coordinates before the final add. The wrap size is a programmable power of 2 
specified, for example, by an API function through the control logic registers. 

Once the above processing operations have taken place, the computed 
offsets are added to the current direct texture coordinates using adder 528. The 
result becomes the new/modified texture coordinate that is used for further direct 
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or indirect texture lookup. Stage output re-circulation buffer 530 is provided to 
allow optionally adding the computation results from a previous processing stage 
may be optionally added. The resulting computed new/modified coordinates are 
passed to the texture retrieval unit 500a. 

Example Hardware Control Register Formats 

Figure 17 shows example logic control register data field formats which may 
be used to define and control parameters and operations within processing unit 
512. For example, certain data fields may be used to pass programmable constants 
ffor use as static matrix elements. Other fields may define data triplet format, 
control bias and scaling factors, indicate the number of indirect operation stages or 
provide other parameters for operations within proc logic unit 512. 

The following table shows non-limiting example control register 
descriptions and formats for controlling operations within indirect-texture/bump 
unit 500b/500c and processing logic 512: 

name: format: description: 

ma i? mb i? SO. 10 Specifies parameters for one of the three texture offset 
mci, md i? matrices. All recycled texture values are passed through 

mei, mfi the matrix 

Si S5 This field specifies post-scale for the matrix. The scale is 

done by performing a shift on the matrix outputs by an 
amount equal to (l«n) for positive values, and (l«(-n)) 
for negative values. 

b^ 2 This field one of up to four indirect textures to use during 

direct texture cycle, 

fmti 2 This field specifies how the s, t, and u offsets, along with 

the 5-bit bump alpha select values bs, bt* and bu, are 
selected from the 24-bit recycled texture data (col). 

biaSj 3 This field specifies whether or not to apply bias to the s, t, 

and u coordinates after formatting and before matrix 
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multiplication. The amount of the bias is -128 for the 
FMTJS format, and +1 for the other three formats (see 
"fmf field description above). 

mi 4 This field selects a matrix and scale to use during the 

current direct texture cycle. In addition to the three 
constant matrices defined using maL.mfi (see above), two 
variable matrices are also defined whose values are 
obtained from the current direct texture coordinates 

Matrix A 

(si 256 t!256\ 
0 0 
0 0 

Matrix B 

( o 0 ^ 
5/256 f/256 
0 0 

sw 1? tWj 3 This field implements a wrap on the s and t direct texture 

coordinates prior to adding the texture offset (this is done 
by using a bit-mask). 

fbt 1 This field specifies whether to include computation 

results from a previous computation stage (retained in 
buffer 530) in the add performed by adder 518. 

imask 8 This field specifies, for each of eight sets of texture state, 

whether the textures associated with this state are indirect 
(l)or direct (0). 

In the proc logic unit, for the control registers shown in Figure 13, registers 
MTXi define matrix element data for three matrices (i.e., i = 0,1,2). Registers 
CMDi define the bump command for each of 16 TEV stages (i = 0-15); registers 
IMASK defines the direct or indirect usage of each of up to eight textures. 

5 Mode Changes 

In an example implementation, operational mode changes within the 
pipeline are handled by interleaving a control register address-data pair (which 
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contains, for example, the address of a particular hardware logic control register 
associated with some circuitry within the pipeline and the appropriate control 
data/instruction for controlling that circuitry) with rasterization data output by the 
rasterizer. This control register address-data pair information trickles down the 
5 graphics pipeline with the data and remains interleaved in the correct order with 
the data that it affects. Consequently, most operational mode changes may be 
effected without "flushing" (purging) the pipeline. Although mode changes may 
be complicated somewhat by the fact that there could be multiple paths data within 
the pipeline for control register data to reach its ultimate destination, more efficient 
10 operation may be obtained, for example, by adherence to the following exemplary 
operational constraints: 

1) Hardware control register data affecting the texture address processor 
500b/500c is passed through direct texture coordinate FIFO (dFIFO), 
(e.g., via switch SO); 

15 2) Hardware control register data affecting direct contexts in the texture 

unit 500a is passed through the direct texture coordinate FIFO 
(dFIFO) in the texture address processor to texture unit 500a; 

3) Hardware control register data affecting indirect texturing contexts in 
texture unit 500a are passed directly from the rasterizer 400 to texture 

20 unit 500a (e.g., via switches SO and SI); and 

4) Hardware control register data affecting the pixel shader (TEV) or 
frame buffer 702 are passed through the direct texture coordinate 
FIFO (dFIFO), the texture address processor 500b/500c, and the 
texture unit 500a. 
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In an example implementation of the present invention, the possible 
texturing contexts are defined as either a direct context or an indirect context. 
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Direct contexts may handle only direct texture data, and indirect contexts may 
handle only indirect texture data. A change in the definition of one or more 
contexts between, for example, indirect to direct or direct to indirect operation, 
may require a partial flush of the graphics pipeline. 

5 Example Indirect Texture Processing Results 

As will now be appreciated, the recirculating direct and indirect texture 
processing architecture described above provides an extremely flexible and 
virtually unlimited functionality. An application programmer can invoke any 
number of logical texture mapping stages to provide any desired sequence of any 
f3 10 number of direct or indirect texture mapping operations. This powerful capability 
rl allows the application programmer to create dynamically a number of complex and 
:^ interesting texture mapping visual effects. 

m As one example, indirect textures can be used for texture warping effects. In 

s ~ this example case, the indirect texture is used to stretch or otherwise distort the 
15 surface texture. A dynamic distortion effect can be achieved by swapping indirect 
maps (or by modifying the indirect map or coordinates). One may apply this effect 
S to a given surface within a scene, or one can take this one step further and apply 

the effect to the entire scene. In the latter case, the scene is first rendered normally 
and then copied to a texture map. One then draws a big rectangle that is then 
20 mapped to the screen using an indirect texture- Texture warping can be used to 
produce shimmering effects, special lens effects, and various psychedelic effects. 

As another example, the indirect feature also allows the drawing texture tile 
maps. In this scenario, one texture map holds the base definition for a variety of 
tiles. An indirect texture map is then used to place specific tiles in specific 
25 locations over a 2D surface. With indirect textures, only one polygon needs to be 
drawn. 
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Other Example Compatible Implementations 

Certain of the above-described system components 50 could be 
implemented as other than the home video game console configuration described 
above. For example, one could run graphics application or other software written 
5 for system 50 on a platform with a different configuration that emulates system 50 
or is otherwise compatible with it. If the other platform can successfully emulate, 
simulate and/or provide some or all of the hardware and software resources of 
system 50, then the other platform will be able to successfully execute the 
software. 

10 As one example, an emulator may provide a hardware and/or software 

y3 configuration (platform) that is different from the hardware and/or software 
Ilj configuration (platform) of system 50. The emulator system might include 
yy software and/or hardware components that emulate or simulate some or all of 
m hardware and/or software components of the system for which the application 
^ 15 software was written. For example, the emulator system could comprise a general 
ki purpose digital computer such as a personal computer, which executes a software 
emulator program that simulates the hardware and/or firmware of system 50. 

Some general purpose digital computers (e.g., IBM or Macintosh personal 
computers and compatibles) are now equipped with 3D graphics cards that provide 

20 3D graphics pipelines compliant with DirectX or other standard 3D graphics 
command APIs. They may also be equipped with stereophonic sound cards that 
provide high quality stereophonic sound based on a standard set of sound 
commands. Such multimedia-hardware-equipped personal computers running 
emulator software may have sufficient performance to approximate the graphics 

25 and sound performance of system 50. Emulator software controls the hardware 
resources on the personal computer platform to simulate the processing, 3D 
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graphics, sound, peripheral and other capabilities of the home video game console 
platform for which the game programmer wrote the game software. 

Figure 15 illustrates an example overall emulation process using a host 
platform 1201, an emulator component 1303, and a game software executable 
5 binary image provided on a storage medium 62. Host 1201 may be a general or 
special purpose digital computing device such as, for example, a personal 
computer, a video game console, or any other platform with sufficient computing 
power. Emulator 1303 may be software and/or hardware that runs on host 
platform 1201, and provides a real-time conversion of commands, data and other 
10 information from storage medium 62 into a form that can be processed by host 
J 1201. For example, emulator 1303 fetches "source" binary-image program 
f j instructions intended for execution by system 50 from storage medium 62 and 
^ converts these program instructions to a target format that can be executed or 
jf! otherwise processed by host 1201. 

15 As one example, in the case where the software is written for execution on a 

?ii platform using an IBM PowerPC or other specific processor and the host 1201 is a 
p personal computer using a different (e.g., Intel) processor, emulator 1303 fetches 
^ one or a sequence of binary-image program instructions from storage medium 62 
and converts these program instructions to one or more equivalent Intel binary- 
20 image program instructions. The emulator 1303 also fetches and/or generates 

graphics commands and audio commands intended for processing by the graphics 
and audio processor 1 14, and converts these commands into a format or formats 
that can be processed by hardware and/or software graphics and audio processing 
resources available on host 1201. As one example, emulator 1303 may convert 
25 these commands into commands that can be processed by specific graphics and/or 
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or sound hardware of the host 1201 (e.g.. using standard DirectX, OpenGL and/or 
sound APIs). 

An emulator 1303 used to provide some or all of the features of the video 
game system described above may also be provided with a graphic user interface 
5 (GUI) that simplifies or automates the selection of various options and screen 
modes for games run using the emulator. In one example, such an emulator 1303 
may further include enhanced functionality as compared with the host platform for 
which the software was originally intended. 

In the case where particular graphics support hardware within an emulator 
10 does not include the example indirect texture referencing features and functions 
y3 illustrated by Figures 8 through 12, the emulator designer has a choice of either: 

Jlj • translating the indirect-texture referencing commands into other graphics 

S API commands the graphics support hardware understands, or 

• implementing indirect-texture referencing in software with a potential 
f*15 corresponding decrease in performance depending upon the speed of the 

^ processor, or 

Jf • "stubbing" (i.e., ignoring) the indirect-texture referencing commands to 

provide a rendered image that does not include effects utilizing indirect- 
texture referencing. 

20 While the Figure 6 flowchart can be implemented entirely in software, 

entirely in hardware or by a combination of hardware and software, the preferred 
embodiment performs most of these calculations in hardware to obtain increased 
speed performance and other advantages. Nevertheless, in other implementations 
(e.g., where a very fast processor is available), some or all of the processing 

25 described herein may be implemented in software to provide similar or identical 
imaging results. 
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Figure 16 illustrates an emulation host system 1201 suitable for use with 
emulator 1303. System 1201 includes a processing unit 1203 and a system 
memory 1205. A system bus 1207 couples various system components including 
system memory 1205 to processing unit 1203. System bus 1207 may be any of 
5 several types of bus structures including a memory bus or memory controller, a 
peripheral bus, and a local bus using any of a variety of bus architectures. System 
memory 1207 includes read only memory (ROM) 1252 and random access 
memory (RAM) 1254. A basic input/output system (BIOS) 1256, containing the 
basic routines that help to transfer information between elements within personal 
10 computer system 1201, such as during start-up, is stored in the ROM 1252. 
O System 1201 further includes various drives and associated computer-readable 
SI media. A hard disk drive 1209 reads from and writes to a (typically fixed) 
ITJ magnetic hard disk 121 1. An additional (possible optional) magnetic disk drive 
SB 1213 reads from and writes to a removable tf floppy" or other magnetic disk 1215. 
* 15 An optical disk drive 1217 reads from and, in some configurations, writes to a 
u removable optical disk 1219 such as a CD ROM or other optical media. Hard disk 
rf H drive 1209 and optical disk drive 1217 are connected to system bus 1207 by a hard 
g disk drive interface 1221 and an optical drive interface 1225, respectively. The 

drives and their associated computer-readable media provide nonvolatile storage of 
20 computer-readable instructions, data structures, program modules, game programs 
and other data for personal computer system 1201 . In other configurations, other 
types of computer-readable media that can store data that is accessible by a 
computer (e.g., magnetic cassettes, flash memory cards, digital video disks, 
Bernoulli cartridges, random access memories (RAMs), read only memories 
25 (ROMs) and the like) may also be used. 
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A number of program modules including emulator 1303 may be stored on 
the hard disk 121 L removable magnetic disk 1215, optical disk 1219 and/or the 
ROM 1252 and/or the RAM 1254 of system memory 1205. Such program 
modules may include an operating system providing graphics and sound APIs, one 
5 or more application programs, other program modules, program data and game 
data. A user may enter commands and information into personal computer system 
1201 through input devices such as a keyboard 1227, pointing device 1229, 
microphones, joysticks, game controllers, satellite dishes, scanners, or the like. 
These and other input devices can be connected to processing unit 1203 through a 
10 serial port interface 1231 that is coupled to system bus 1207, but may be connected 
by other interfaces, such as a parallel port, game port Fire wire bus or a universal 
serial bus (USB). A monitor 1233 or other type of display device is also connected 
to system bus 1207 via an interface, such as a video adapter 1235. 

! System 1201 may also include a modem 1 154 or other network interface 

15 means for establishing communications over a network 1 152 such as the Internet. 
Modem 1 154, which may be internal or external, is connected to system bus 123 
via serial port interface 1231. A network interface 1 156 may also be provided for 
allowing system 1201 to communicate with a remote computing device 1150 (e.g.. 
another system 1201) via a local area network 1 158 (or such communication may 
20 be via wide area network 1 152 or other communications path such as dial-up or 
other communications means). System 1201 will typically include other peripheral 
output devices, such as printers and other standard peripheral devices. 

In one example, video adapter 1235 may include a 3D graphics pipeline chip 
set providing fast 3D graphics rendering in response to 3D graphics commands 
25 issued based on a standard 3D graphics application programmer interface such as 
Microsoft's DirectX 7.0 or other version. A set of stereo loudspeakers 1237 is also 
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connected to system bus 1207 via a sound generating interface such as a 
conventional "sound card" providing hardware and embedded software support for 
generating high quality stereophonic sound based on sound commands provided by 
bus 1207. These hardware capabilities allow system 1201 to provide sufficient 
graphics and sound speed performance to play software stored in storage medium 
62. 

All documents referenced above are hereby incorporated by reference. 

While the invention has been described in connection with what is presently 
considered to be the most practical and preferred embodiment, it is to be 
understood that the invention is not to be limited to the disclosed embodiment, but 
on the contrary, is intended to cover various modifications and equivalent 
arrangements included within the scope of the appended claims. 
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We Claim: 

1 1 . In a graphics system including a graphics processing pipeline that 

2 renders and displays images at least in part in response to primitive vertex data and 

3 texture data, a texture processing system for mapping a texture to a surface of a 

4 rendered image object, said texture processing system comprising: 

5 a texture coordinate/data processing unit that interleaves processing of 

6 logical direct and indirect coordinate data; 

7 a texture data retrieval unit connected to the coordinate/data 

8 processing unit, the texture data retrieval unit retrieving texture data; and 

0 9 a data feedback path from the texture data retrieval unit to the texture 

Q10 coordinate/data processing unit to allow reuse of the texture coordinate/data 

Hj 1 1 processing unit in the same rendering pass; 

§3 12 wherein in response to a set of indirect texture coordinates the 

? 13 retrieval unit recirculates retrieved texture data back to the processing unit for 

M =14 deriving modified texture coordinates which are used in mapping a texture to a 

ni 

|H 15 surface of a rendered image object. 

^ 1 2. The graphics system as set forth on claim 1 wherein the texture 

2 coordinate/data processing unit further comprises a set of hardware control logic 

3 registers coupled to data lines in the pipeline for receiving data and processing 

4 command information used to initiate indirect texture referencing and to control 

5 multiplication and addition operations for deriving said modified texture 

6 coordinates. 

7 3. In a graphics system having a memory containing texture data, a 

8 method of indirect texture referencing comprising the steps of: 

9 (a) using indirect texture coordinates to generate a data triplet; 
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10 (b) deriving texture coordinates based at least in part on the generated 

1 1 triplet; and 

12 (c) using at least the derived texture coordinates to map predetermined 

1 3 texture data onto a primitive. 

1 4. A method of indirect texture referencing as in claim 3 wherein the 

2 using step (a) includes referencing, in a texture memory, an array that maps the 

3 triplet via the indirect texture coordinates. 

1 5. A method of indirect texture referencing as in claim 3 wherein the 

2 using step (a) includes fetching predetermined triplet data from a texture map 
r . 3 stored in memory. 

H 1 6. A method of indirect texture referencing as in claim 3 wherein the 

fy 2 using step (c) includes referencing, in a texture memory, an array that maps color 

go 3 value data via the derived texture coordinates. 

; 1 7. A method of indirect texture referencing as in claim 3 wherein the 

¥>_ 2 deriving step (b) performs a matrix multiplication operation wherein a first matrix 

m 3 comprising a plurality of predetermined constant and/or variable scalar elements is 

Q 4 multiplied by a second matrix comprising a retrieved data triplet. 

1 8. A method of indirect texture referencing as in claim 3 wherein a 

2 triplet comprises s, t and u texture coordinate offsets. 

1 9. A method of indirect texture referencing as in claim 3 wherein a 

2 triplet comprises three eight-bit binary values. 

1 10. A method of indirect texture referencing as in claim 3 wherein a 

2 triplet comprises three five-bit binary values. 
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11. A method of indirect texture referencing as in claim 3 wherein a 
triplet comprises three four-bit binary values. 

12. A method of indirect texture referencing as in claim 3 wherein a 
triplet comprises three three-bit binary values. 

13. A method of indirect texture referencing as in claim 3 wherein a 
quadruplet of indirect texture coordinates are processed together to produce a 
quadruplet of derived texture coordinates for mapping texture data to polygons. 

14. A method of indirect texture referencing as in claim 7 wherein said 
first matrix comprises elements that are a scalar function of one or more 
predetermined texture coordinates. 

15. A method of indirect texture referencing as in claim 7 wherein said 
first matrix is a 3 X 2 matrix comprising six predetermined scalar elements. 

16. A method of indirect texture referencing as in claim 7 wherein said 
first matrix is arranged having elements as follows: 

(si 256 ti 256\ 
0 0 

0 0 

v ) 

wherein s and t are predetermined texture coordinates. 

17. A method of indirect texture referencing as in claim 7 wherein said 
first matrix is arranged having elements as follows: 

( 0 0 ^ 
5/256 r/256 

V 0 0 J 

wherein s and t are predetermined texture coordinates. 
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1 18. In a 3D videographics system having a memory containing texture 

2 data stored in a texture memory, the texture data accessed via either a set of direct 

3 texture coordinates or a set of indirect texture coordinates, a method of indirect 

4 texture referencing for mapping a predetermined texture onto a polygon 

5 comprising the steps of: 

6 (a) using a set of indirect texture-coordinates to retrieve a data triplet 

7 stored in texture memory; 

8 (b) deriving a set of modified texture coordinates based at least in part 

9 on the retrieved data triplet; and 

plO (c) using the set of modified texture coordinates to reference texture 

■.iPi 

Cjl 1 data stored in texture memory corresponding to the predetermined texture. 

fU 1 19. A method of indirect texture referencing as in claim 18 wherein the 

m 2 deriving step (b) includes performing at least one matrix multiplication operation 

s 3 wherein a first matrix comprising a plurality of predetermined constant and/or 

Li. 

y> 4 variable scalar elements is multiplied by a second matrix comprising a retrieved 

m 5 data triplet. 



0 1 20. A method of indirect texture referencing as in claim 18 wherein the 

2 using step (c) includes referencing, in a texture memory, an array that maps color 

3 value data via the derived texture coordinates. 

1 2 1 . A method of indirect texture referencing as in claim 18 wherein a data 

2 triplet comprise at least one texture coordinate offset. 

1 22. A method of indirect texture referencing as in claim 18 wherein a data 

2 triplet comprises s 5 1, and u texture coordinate offset values. 

1 23. A method of indirect texture referencing as in claim 18 wherein a data 

2 triplet comprises three eight-bit binary values. 
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24. A method of indirect texture referencing as in claim 1 8 wherein a data 
triplet comprises three five-bit binary values. 

25. A method of indirect texture referencing as in claim 18 wherein a data 
triplet comprises three four-bit binary values. 

26. A method of indirect texture referencing as in claim 18 wherein a data 
triplet comprises three three-bit binary values. 

27. A method of indirect texture referencing as in claim 19 wherein said 
first matrix comprises elements that are a scalar function of one or more 
predetermined direct texture coordinates. 

28. A method of indirect texture referencing as in claim 19 wherein said 
first matrix is a 3 X 2 matrix comprising six predetermined scalar elements. 

29. A method of indirect texture referencing as in claim 19 wherein said 
first matrix is arranged having elements as follows: 

'si 256 tf 256^ 
0 0 



v 



0 



0 



wherein s and t are predetermined current direct texture coordinates. 

30. A method of indirect texture referencing as in claim 19 wherein said 
first matrix is arranged having elements as follows: 



f 0 0 ^ 
5/256 r/256 



V 



0 0 

wherein s and t are predetermined current direct texture coordinates. 
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1 31. In a graphics system including a graphics processing pipeline that 

2 renders and displays images at least in part in response to polygon vertex data and 

3 texture data stored in a memory, the graphics processing pipeline having a texture 

4 subsystem for accessing and retrieving texture, the texture subsystem comprising a 

5 texture coordinate/data processing unit having: a) at least one binary data 

6 multiplier, at least one binary data accumulator and at least one control register for 

7 receiving instruction codes and/or data to control texture coordinate/data 

8 processing operations, b) a texture data retrieval unit connected to the 

9 coordinate/data processing unit, the texture data retrieval unit retrieving texture 
10 data stored in a texture memory, and c) a data feedback path from the texture data 

0 1 1 retrieval unit to the texture coordinate/data processing unit to recycle retrieved 

y_l2 texture data through the texture coordinate/data processing unit for further 

ftl 13 processing, wherein in response to a set of indirect texture coordinates the retrieval 

ffl 14 unit provides retrieved texture data to the processing unit for deriving modified 

* 15 texture coordinates, a method for controlling the texture subsystem to perform one 

H 16 or more indirect texture referencing operations comprising the step of utilizing a 

H 17 generalized indirect-texture referencing API command function to place 

n 18 appropriate instruction codes and/or data in said control register(s), wherein said 

19 indirect-texture referencing function may be used to at least: 



20 (i) define up to eight textures stored in a texture memory; 

21 (ii) specify up to eight sets of texture coordinates; 

22 (iii) define up to four indirect texture maps; 

23 (iv) specify up to four indirect texture referencing operations to be 



24 performed; 
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25 (v) associate one of said eight textures with each indirect texture map; 

26 and 

27 (vi) associate one of said eight sets of texture coordinates with each 

28 indirect texture maps. 

1 32. In a graphics system having a memory containing texture data, a 

2 method of indirect texture referencing comprising the steps of: 

3 (a) using a set of indirect texture-coordinates to retrieve offset data 

4 from the memory; 

5 (b) multiplying the offset data by predetermined values forming 

O 6 elements of a texture offset matrix to produce a set of texture offset coordinates; 

SI 7 and 

Ffi g 
Fs - 

{ JJ 8 (c) using said set of offset texture coordinates for referencing texture 

9 data stored in the memory when mapping a predetermined texture to a rendered 

!\ 10 polygon. 

hi 1 33. A method of indirect texture referencing as in claim 32 wherein said 

K 2 offset data is a result of a predetermined texturing function and comprises at least 

^ 3 one texture coordinate offset value. 

1 33. A method of indirect texture referencing as in claim 32 wherein said 

2 offset data comprises a set of three values for producing said set of texture offset 

3 coordinates. 

1 34. A method of indirect texture referencing as in claim 33 wherein said 

2 set of three values comprise s, t and u coordinate offset data. 

1 35. A method of indirect texture referencing as in claim 32 wherein said 

2 matrix elements comprise a set of predetermined constants. 
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1 36. A method of indirect texture referencing as in claim 32 wherein said 

2 matrix elements comprise a set of constant and variable values. 

1 37. A method of indirect texture referencing as in claim 32 wherein one or 

2 more elements of said matrix are a mathematical function of one or more 

3 predetermined texture coordinates. 

1 38. A method of indirect texture referencing as in claim 32 wherein the 

2 matrix is a 3 X 2 matrix comprising six predetermined scalar elements. 

1 39. A method of indirect texture referencing as in claim 32 wherein the 

2 matrix is arranged having elements as follows: 

fs/256 f/256") 

3 0 0 

i 0 0 

4 wherein s and t are predetermined current direct texture coordinates. 

1 40. A method of indirect texture referencing as in claim 32 wherein the 

2 matrix is arranged having elements as follows: 

^ 0 0 ^ 
5/256 //256 

V ° ° 

4 wherein s and t are predetermined current direct texture coordinates. 

1 41 . In a graphics system including a graphics engine that renders and 

2 displays images at least in part in response to polygon vertex data and texture data 

3 stored in an associated memory, a texture processing subsystem for selectively 

4 mapping texture data corresponding to one or more different textures and/or 

5 texture characteristics to surfaces of said rendered and displayed images, said 

6 texture processing subsystem including a texture coordinate offset matrix 
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7 arrangement producing a set of offset texture coordinates by multiplying indirect 

8 texture data by elements of a matrix, wherein one or more elements of the matrix 

9 are a mathematical function of one or more predetermined direct texture 
10 coordinates. 

1 42. A texture processing subsystem as in claim 41 wherein the matrix has 

2 six elements in three rows and two columns. 



1 
2 



ry 4 

| l 

Si 2 

ry 3 



l 

2 
3 
4 
5 
6 
7 



43. A texture processing subsystem as in claim 41 wherein the matrix is 



arranged having elements as follows: 



'si 256 t/256^ 
0 0 



V 



0 



0 



J 



wherein s and t are predetermined current direct texture coordinates. 

44. A texture processing subsystem as in claim 41 wherein the matrix is 
arranged having elements as follows: 



f 0 0 > 
5/256 f/256 



V 



0 0 

wherein s and t are predetermined current direct texture coordinates. 

45. In a graphics system including a graphics engine that renders and 
displays images at least in part in response to vertex data and texture data stored in 
an associated memory, a texture processing subsystem for selectively mapping 
texture data corresponding to one or more different textures and/or texture 
characteristics to surfaces of said rendered and displayed images, said texture 
processing subsystem including a texture coordinate offset matrix arrangement for 
producing a set of offset texture coordinates by multiplying indirect texture data by 
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8 elements of a matrix, wherein one or more elements of the matrix can be 

9 selectively loaded. 

1 46. A texture processing subsystem as in claim 45 wherein said elements 

2 are predetermined constants. 

1 47. A texture processing subsystem as in claim 45 wherein one or more of 

2 said elements are a mathematical function of one or more predetermined direct 

3 texture coordinates. 

1 48. A texture processing subsystem as in claim 45 wherein the matrix has 

2 six elements in three rows and two columns. 

yb 1 49. In a graphics system having a memory containing texture data, a 

ry 2 method of indirect texture referencing comprising the steps of: 

^ 3 (a) using a set of indirect texture coordinates to retrieve texture offset 

W 4 data from memory, said offset data being a result of a predetermined texturing 

M= 5 function and comprising at least one texture coordinate offset value; 

jj 6 (b) multiplying said texture offset data by predetermined constant or 

y 7 variable scalar factors; and 

8 (c) using said set of offset texture coordinates for referencing 

9 predetermined texture data in memory. 

1 50. A method of indirect texture referencing as in claim 49 wherein said 

2 offset data comprise a set of three values for producing said set of texture offset 

3 coordinates. 

1 5 1 . A method of indirect texture referencing as in claim 49 wherein said 

2 offset data comprise a set of three eight-bit binary values. 
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1 52. A method of indirect texture referencing as in claim 49 wherein said 

2 offset data comprise a set of three five-bit binary values. 

1 53. A method of indirect texture referencing as in claim 49 wherein said 

2 offset data comprise a set of three four-bit binary values. 

1 54. A method of indirect texture referencing as in claim 49 wherein said 

2 offset data comprise a set of three three-bit binary values. 

1 55. In a graphics system having a memory containing texture data stored 



2 in a texture memory, the texture data accessed using either a set of direct texture - 

3 coordinates or a set of indirect texture-coordinates, a method of implementing 

4 multiple levels of indirection during indirect texture referencing for mapping a 
">j 5 texture onto a primitive, comprising the steps of: 



ri 6 (a) using a set of indirect texture-coordinates to retrieve data triplets 

51 7 stored in texture memory; 

U 8 (b) deriving a set of modified texture coordinates based at least in part 

ry 9 on the retrieved data triplets; 

O 10 (c) using the set of modified texture coordinates for retrieving data 

1 1 stored in texture memory; 

12 (d) reiteratively repeating steps (b) and (c) for a predetermined 

13 number of data retrievals; and 

14 (e) using a set of derived texture coordinates resulting from step (d) to 

1 5 map predetermined texture data onto the primitive. 

1 56. In a graphics system including at least one texture mapping unit, a 

2 method of operating the texture mapping unit to provide a plurality of logical 

3 texture mapping stages comprising: 
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4 (a) presenting a first set of texture mapping parameters to the texture 

5 mapping unit; 

6 (b) controlling the texture mapping unit to perform a first texture mapping 

7 operation in response to said parameters presented by step (a); 

8 (c) recirculating results of step (b) to develop a further set of texture 

9 mapping parameters; and 

10 (d) presenting said further set of parameters to the texture mapping unit and 

1 1 controlling said texture mapping unit to perform a further texture mapping 

1 2 operation in response to said further set of parameters. 

1 57. The method of claim 56 wherein step (b) includes performing an 

2 indirect texture mapping operation that develops a data set, recirculating step (c) 

3 comprises developing a further set of texture mapping parameters responsive to 

4 said developed data set; and step (d) includes performing a direct texture mapping 

5 operation in response to the further set of texture mapping parameters. 

1 58. The method of claim 56 wherein the developing step develops said 

2 further set of texture mapping parameters by combining a further set of direct 

3 texture coordinates with said data set. 
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Abstract Of The Disclosure 

A graphics system including a custom graphics and audio processor 
produces exciting 2D and 3D graphics and surround sound. The system includes a 
graphics and audio processor including a 3D graphics pipeline and an audio digital 
signal processor. The graphics pipeline renders and prepares images for display at 
least in part in response to polygon vertex attribute data and texel color data stored 
as a texture images in an associated memory. An efficient texturing pipeline 
arrangement achieves a relatively low chip-footprint by utilizing a single texture 
coordinate/data processing unit that interleaves the processing of logical direct and 
indirect texture coordinate data and a texture lookup data feedback path for 
"recirculating" indirect texture lookup data retrieved from a single texture retrieval 
unit back to the texture coordinate/data processing unit. Versatile indirect texture 
referencing is achieved by using the same texture coordinate/data processing unit 
to transform the recirculated texture lookup data into offsets that may be added to 
the texture coordinates of a direct texture lookup. A generalized indirect texture 
API function is provided that supports defining at least four indirect texture 
referencing operations and allows for selectively associating one of at least eight 
different texture images with each indirect texture defined. Retrieved indirect 
texture lookup data is processed as multi-bit binary data triplets of three, four, five, 
or eight bits. The data triplets are multiplied by a 3 X 2 texture coordinate offset 
matrix before being optionally combined with regular non-indirect coordinate data 
or coordinate data from a previous cycle/stage of processing. Values of the offset 
matrix elements are variable and may be dynamically defined for each cycle/stage 
using selected constants. Two additional variable matrix configurations are also 
defined containing element values obtained from current direct texture coordinates. 
Circuitry for optionally biasing and scaling retrieved texture data is also provided. 
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